This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



Certificate of Mailing 

I hereby certify that this correspondence isjjeing deposited with the United States Postal Service as first class mail in an envelope 
addressed to: Mail Stop f£l /^/vJtfr dommissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 on 



Printed: 




Uj cv, 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE O ^ 

In re Application of: Hillman et al. ^ ^ ^ 

i 

Title: NOVEL HUMAN MEMBRANE PROTEIN 



Serial No.: 09/898,216 



Filing Date: July 02, 2001 



Examiner: Yaen.C. Group Art Unit: 1642 

Mail stop l\Jon RjL'fto ru^rrJUut 

Commissioner for Patents 

P.O. Box 1450 

Alexandria, VA 22313-1450 



DECLARATION OF LARS MICHAEL FURNESS 
UNDER 37 C.F.R. § 1.132 



I, L. MICHAEL FURNESS, a citizen of the United Kingdom, residing at 2 Brookside, 
Exning, Newmarket, United Kingdom, declare that: 

1. I was employed by Incyte Genomics, Inc. (hereinafter "Incyte") as a Director 
of Pharmacogenomics until December 31, 2001. I am currently under contract to be a Consultant to 
Incyte Genomics, Inc. 

2. In 1984, I received a B.Sc.(Hons) in Biomolecular Science (Biophysics and 
Biochemistry) from Portsmouth Polytechnic. 

From 1985-1987 1 was at the School of Pharmacy in London, United Kingdom, during 
which time I analyzed lipid methyltransferase enzymes using a variety of protein analysis methods, 
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including one-dimensional (ID) and two-dimensional (2D) gel electrophoresis, HPLC, and a variety of 
enzymatic assay systems. 

I then worked in the Protein Structure group at the National Institute for Medical 
Research until 1989, setting up core facilities for nucleic acid synthesis and sequencing, as well as 
assisting in programs on protein kinase C inhibitors. 

After a year at Perkin Elmer-Applied Biosystems as a technical specialist, I worked at 
the Imperial Cancer Research Fund between 1990-1992, on a Eureka-funded program collaborating 
with Amersham Pharmacia in the United Kingdom and CEPH (Centre d'Etude du Polymorphisme 
Humaine) in Paris, France, to develop novel nucleic acid purification and characterization methods. 

In 1992, 1 moved to Pfizer Central Research in the United Kingdom, where I stayed 
until 1998, initially setting up core DNA sequencing and then a DNA arraying facility for gene 
expression analysis in 1993. My work also included bioinformatics and I was responsible for the 
support of all Pfizer neuroscience programs in the United Kingdom. This then led me into carrying out 
detailed bioinformatics and wet lab work on the sodium channels, including antibody generation, 
Western and Northern analyses, PCR, tissue distribution studies, and sequence analyses on novel 
sequences identified. 

In 1998, 1 moved to Incyte Genomics, Inc., to the Pharmacogenomics group to look at 
the application of genomics and proteomics to the pharmaceutical industry. In 1999, 1 was appointed 
director of the LifeExpress Lead Program which used microarray and protein expression data to 
identify pharmacologically and toxicologically relevant mechanisms to assist in improved drug design 
and development. 

On December 12, 2001 1 founded Nuomics Consulting Ltd., in Exning, U.K., and I am 
currently employed as Managing Director. Nuomics Consulting Ltd. will be providing expert technical 
knowledge and advice to businesses around the areas of genomics, proteomics, pharmacogenomics, 
toxicogenomics and chemogenomics. 

3. I have reviewed the specification of a United States patent application that I 
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understand was filed on July 2, 2001 in the names of Hillman, J. et al and was assigned Serial No. 
09/898,216 (hereinafter "the Hillman '216 application"). Furthermore, I understand that this United 
States patent application was a continuation application of and claimed priority to United States patent 
application Serial No. 09/095,351 filed on June 9, 1998 (hereinafter "the Hillman '351 application"), 
which in turn was a divisional application of and claimed priority to United States patent application 
Serial No. 08/781,562 filed on January 9, 1997 (hereinafter "the Hillman '562 application"), all having 
the identical specification. My remarks herein will therefore be directed to the Hillman '562 patent 
application, and January 9, 1997, as the relevant date of filing. In broad overview, the Hillman '562 
specification pertains to certain nucleotide and amino acid sequences and their use in a number of 
applications, including gene and protein expression monitoring applications that are useful in connection 
with (a) developing drugs (e.g., for the treatment of cancer), and (b) monitoring the activity of drugs for 
purposes relating to evaluating their efficacy and toxicity. 

4. I understand that (a) the Hillman '216 application contains claims that are 
directed to a substantially purified polypeptide having the sequence shown as SEQ ID NO:l 
(hereinafter "the SEQ ID NO:l polypeptide"), and (b) the Patent Examiner has rejected those claims 
on the grounds that the specification of the Hillman '216 application does not disclose a substantial, 
specific and credible utility for the claimed SEQ ID NO:l polypeptide. I further understand that 
whether or not a patent specification discloses a substantial, specific and credible utility for its claimed 
subject matter is properly determined from the perspective of a person skilled in the art to which the 
specification pertains at the time of the patent application was filed. In addition, I understand that a 
substantial, specific and credible utility under the patent laws must be a "real-world" utility. 

5. I have been asked (a) to consider with a view to reaching a conclusion (or 
conclusions) as to whether or not I agree with the Patent Examiner's position that the Hillman '216 
application and its parents, the Hillman '351 and the Hillman '562 applications, does not disclose a 
substantial, specific and credible "real-world" utility for the claimed SEQ ID NO:l polypeptide, and 
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(b) to state and explain the bases for any conclusions I reach. I have been informed that, in connection 
with my considerations, I should determine whether or not a person skilled in the art to which the 
Hillman '562 application pertains on January 9, 1997, would have concluded that the Hillman '562 
application disclosed, for the benefit of the public, a specific beneficial use of the SEQ ID NO:l 
polypeptide in its then available and disclosed form. I have also been informed that, with respect to the 
"real-world" utility requirement, the Patent and Trademark Office instructs its Patent Examiners in 
Section 2107 of the Manual of Patent Examining Procedure, under the heading "L f Real-World Value' 
Requirement": 

"Many research tools such as gas chromatographs, screening assays, and 
nucleotide sequencing techniques have a clear, specific and unquestionable utility (e.g., 
they are useful in analyzing compounds). An assessment that focuses on whether an 
invention is useful only in a research setting thus does not address whether the specific 
invention is in fact 'useful' in a patent sense. Instead, Office personnel must distinguish 
between inventions that have a specifically identified utility and inventions whose 
specific utility requires further research to identify or reasonably confirm." 

6. I have considered the matters set forth in paragraph 5 of this Declaration and 
have concluded that, contrary to the position I understand the Patent Examiner has taken, the 
specification of the Hillman '562 patent application disclosed to a person skilled in the art at the time of 
its filing a number of substantial, specific and credible real-world utilities for the claimed SEQ ID NO:l 
polypeptide. More specifically, persons skilled in the art on January 9, 1997 would have understood 
the Hillman '562 application to disclose the use of the SEQ ID NO:l polypeptide as a research tool in 

a number of gene and protein expression monitoring applications that were well-known at that time to 
be useful in connection with the development of drugs and the monitoring of the activity of such drugs. I 
explain the bases for reaching my conclusion in this regard in paragraphs 7-13 below. 

7. In reaching the conclusion stated in paragraph 6 of this Declaration, I 
considered (a) the specification of the Hillman '562 application, and (b) a number of published articles 
and patent documents that evidence gene and protein expression monitoring techniques that were 
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well-known before the January 9, 1997 filing date of the Hillman '562 application. The published 
articles and patent documents I considered are: 

(a) Anderson, N.L., Esquer-Blasco, R., Hofmann, J.-P., Anderson, N.G., 
A Two-Dimensional Gel Database of Rat Liver Proteins Useful in Gene Regulation and Drug Effects 
Studies , Electrophoresis, 12, 907-930 (1991) (hereinafter "the Anderson 1991 article") (copy annexed 
at Tab A); 

(b) Anderson, N.L., Esquer-Blasco, R., Hofmann, J.-P., Mehues, L., 
Raymackers, J., Steiner, S. Witzmann, F., Anderson, N.G., An Updated Two-Dimensional Gel 
Database of Rat Liver Proteins Useful in Gene Regulation and Drug Effect Studies , Electrophoresis, 16, 
1977-1981 (1995) (hereinafter "the Anderson 1995 article") (copy annexed at Tab B); 

(c) Wilkins, M.R., Sanchez, J.-C., Gooley, A.A., Appel, R.D., 
Humphery-Smith, I., Hochstrasser, D.F., Williams, K.L., Progress with Proteome Projects: Why all 
Proteins Expressed by a Genome Should be Identified and How To Do It , Biotechnology and Genetic 
Engineering Reviews, 13, 19-50 (1995) (hereinafter "the Wilkins article") (copy annexed at Tab C); 

(d) Celis, J.E., Rasmussen, H.H., Leffers, H., Madsen, P., Honore, B., 
Gesser, B., Dejgaard, K., Vandekerckhove, J., Human Cellular Protein Patterns and their Link to 
Genome DNA Sequence Data: Usefulness of Two-Dimensional Gel Electrophoresis and 
Microsequencing, FASEB Journal, 5, 2200-2208 (1991) (hereinafter "the Celis article") (copy 
annexed at Tab D); 

(e) Franzen, B., Linder, S M Okuzawa, K., Kato, H., Auer, G., 
Nonenzvmatic Extraction of Cells from Clinical Tumor Material for Analysis of Gene Expression by 
Two-Dimensional Pol vacrvl amide Gel Electrophoresis , Electrophoresis, 14, 1045-1053 (1993) 
(hereinafter "the Franzen article") (copy annexed at Tab E); 

(f) Bjellqvist, B., Basse, B., Olsen, E., Celis, J.E., Reference Points for 
Comparisons of Two-Dimensional Maps of Proteins from Different Human Cell Types Defined in a pH 
Scale Where Isoelectric Points Correlate with Polypeptide Compositions , Electrophoresis, 15, 529- 
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539 (1994) (hereinafter "the Bjellqvist article") (copy annexed at Tab F); 

(g) Large Scale Biology Company Info; LSB and LSP Information; from 
http://www.lsbc.com (2001) (copy annexed at Tab G); 

8. Many of the published articles I considered (i.e., at least items (a)-(f) identified 
in paragraph 7) relate to the development of protein two-dimensional gel electrophoretic techniques for 
use in protein expression monitoring applications in drug development and toxicology. As I will discuss 
below, a person skilled in the art who read the Hillman '562 application on January 9, 1997 would 
have understood that application to disclose the SEQ ID NO:l polypeptide to be useful for a number 

of protein expression monitoring applications, e.g., in the use of two-dimensional polyacrylamide gel 
electrophoresis and western blot analysis of tissue samples in drug development and in toxicity testing. 

Furthermore, items (a)-(f) establish that protein two-dimensional polyacrylamide gel 
electrophoresis and western blot analysis were well-known and established methods routinely used in 
toxicology testing and drug development at the time of filing the Hillman '562 application and for several 
years prior to January 9, 1997. As such, one of ordinary skill in the art would have recognized that the 
polypeptide of SEQ ID NO: 1 could be used in toxicology testing and drug development, irrespective of 
its biochemical activities. 

9. Turning more specifically to the Hillman '562 specification, the SEQ ID NO:l 
polypeptide is shown at pages 53-54 as one of 7 sequences under the heading "Sequence Listing." 
The Hillman '562 specification teaches that the invention features a substantially purified polypeptide 
having the amino acid sequence shown in SEQ ID NO:l (Hillman '562 application at p. 14, lines 5- 
29). It further teaches that (a) the identity of the SEQ ID NO: 1 polypeptide was determined from a 
Prostate cDNA library, (b) the SEQ ID NO:l polypeptide is the integral membrane protein referred to 
as "IMP" and is encoded by SEQ ID NO:2, and (c) northern analysis shows that IMP mRNA is 
expressed "in prostate tumor (2/13), breast tumor (1/13), and pancreatic tumor libraries (1/13)" and 
therefore IMP appears to be involved in proliferative diseases, such as prostate, breast, and pancreatic 
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cancers (Hillman '562 application at p. 15, lines 23-27, Invention section). Furthermore, based on the 
chemical and structural homology among IMP and stomatin, IMP appears to play a role in the 
regulation of ion channels. Therefore IMP may play a role in ion transport or membrane conductance 
such as hemolytic anemias, as well as prostate, breast, and pancreatic tumors (Hillman '562 application 
at p. 27, line 27, through p. 28, 1. 1, Invention section). 

The Hillman '562 application discusses a number of uses of the SEQ ID NO:l 
polypeptide in addition to its use in protein expression monitoring applications. I have not fully 
evaluated these additional uses in connection with the preparation of this Declaration and do not 
express any views in this Declaration regarding whether or not the Hillman '562 specification discloses 
these additional uses to be substantial, specific and credible real-world utilities of the SEQ ID NO:l 
polypeptide. Consequently, my discussion in this Declaration concerning the Hillman '562 application 
focuses on the portions of the application that relate to the use of the SEQ ID NO: 1 polypeptide in 
protein expression monitoring applications. 

10. The Hillman '562 application discloses that the polynucleotide sequences 
disclosed therein, including the polynucleotides encoding the SEQ ID NO:l polypeptide (antigen used 
to produce the claimed antibodies to SEQ ID NO:l polypeptide), are useful as probes in chip based 
technologies. It further teaches that the chip based technologies can be used "for the detection and/or 
quantification of nucleic acid or protein" (Hillman '562 application at p. 25, lines 18-22). 

The Hillman '562 application also discloses that thelsEQ^IDJSfO: 1 polypeptide or the 
antibody to SEQ ID NO:l polypeptide are useful in other protein expression detection technologies. 
The Hillman '562 application states that "[a] variety of protocols for measuring the expression of IMP, 
using either polyclonal or monoclonal antibodies specific for the respective protein are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 
fluorescence activated cell sorting (FACS)" (Hillman '562 application at p.26, lines 1-4). 

In addition, at the time of filing the Hillman '562 application, it was well known in the art 
that "gene" and protein expression analyses also included two-dimensional polyacrylamide gel 
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electrophoresis (2-D PAGE) technologies, which were developed during the 1980s, and as exemplified 
by the Anderson 1991 and 1995 articles (Tab A and Tab B). The Anderson 1991 article teaches that 
a 2-D PAGE map has been used to connect and compare hundreds of 2-D gels of rat liver samples 
from a variety of studies including regulation of protein expression by various drugs and toxic agents 
(Tab A at p. 907). The Anderson 1991 article teaches an empirically-determined standard curve fitted 
to a series of identified proteins based upon amino acid chain length (Tab A at p. 91 1) and how that 
standard curve can be use in protein expression analysis. The Anderson 1991 article teaches that 
"there is a long-term need for a comprehensive database of liver proteins" (Tab A at p. 912). 

The Wilkins article (Tab C) is one of a number of documents that were published prior 
to the January 9, 1997 filing date of the Hillman '562 application that describes the use of the 2-D 
PAGE technology in a wide range of gene and protein expression monitoring applications, including 
monitoring and analyzing protein expression patterns in human cancer, human serum plasma proteins, 
and in rodent liver following exposure to toxins. In view of the Hillman '562 application, the Wilkins 
article, and other related pre- January 1997 publications, persons skilled in the art on January 9, 1997 
clearly would have understood the Hillman '562 application to disclose the SEQ ID NO:l polypeptide 
or the antibody to SEQ ID NO:l polypeptide to be useful in 2-D PAGE analyses for the development 
of new drugs and monitoring the activities of drugs for such purposes as evaluating their efficacy and 
toxicity, as explained more fully in paragraph 12 below. 

With specific reference to toxicity evaluations, those of skill in the art who were 
working on drug development in January 1997 (and for many years prior to January 1997) without any 
doubt appreciated that the toxicity (or lack of toxicity) of any proposed drug they were working on was 
one of the most important criteria to be considered and evaluated in connection with the development 
of the drug. They would have understood at that time that good drugs are not only potent, they are 
specific. This means that they have strong effects on a specific biological target and minimal effects on 
all other biological targets. Ascertaining that a candidate drug affects its intended target, and 
identification of undesirable secondary effects (i.e., toxic side effects), had been for many years among 
the main challenges in developing new drugs. The ability to determine which genes are positively 
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affected by a given drug, coupled with the ability to quickly and at the earliest time possible in the drug 
development process identify drugs that are likely to be toxic because of their undesirable secondary 
effects, have enormous value in improving the efficiency of the drug discovery process, and are an 
important and essential part of the development of any new drug. In fact, the desire to identify and 
understand toxicological effects using the experimental assays described above led Dr. Leigh Anderson 
to found the Large Scale Biology Corporation in 1985, in order to pursue commercial development of 
the 2-D electrophoretic protein mapping technology he had developed. In addition, the company 
focused on toxicological effects on the proteome as clearly demonstrated by its goals and by its senior 
management credentials described in company documents (see Tab G at pp. 1, 3, and 5). 

Accordingly, the teachings in the Hillman '562 application, in particular regarding use of 
SEQ ID NO:l polypeptide or the antibody to SEQ ID NO:l polypeptide in differential gene and 
protein expression analysis (2-D PAGE maps) and in the development and the monitoring of the 
activities of drugs, clearly includes toxicity studies and persons skilled in the art who read the Hillman 
'562 application on January 9, 1997 would have understood that to be so. 

11. As previously discussed (supra, paragraphs 2 and 7), in the mid-1980s the 
several publications annexed to this Declaration at Tabs A through F evidence information that was 
available to the public regarding two-dimensional polyacrylamide gel electrophoresis technology and its 
uses in drug discovery and toxicology testing before the January 9, 1997 filing date of the Hillman '562 
application. In particular the Celis article stated that "protein databases are expected to foster a variety 
of biological information.... - among others, . . . drug development and testing" (See Tab D, p. 2200, 
second column). The Franzen article shows that 2-D PAGE maps were used to identify proteins in 
clinical tumor material (See Tab E). The Hillman '562 application clearly discloses that expression of 
IMP is associated with cancerous tissues (Hillman '562 application at page 15, lines 23-27). The 
Bjellqvist article showed that a protein may be identified accurately by its positional co-ordinates, 
namely molecular mass and isoelectric point (See Tab F). The Hillman '562 application clearly 
disclosed SEQ ID NO: 1 from which it would have been routine for one of skill in the art to predict both 
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the molecular mass and the isoelectric point using algorithms well known in the art at the time of filing. 
In addition, the antibody to SEQ ID NO:l polypeptide provides another basis for identifying the SEQ 
ID NO:l polypeptide. 

12. A person skilled in the art on January 9, 1997, who read the Hillman '562 
application, would understand that application to disclose the SEQ ID NO:l polypeptide to be highly 
useful in analysis of differential expression of proteins. For example, the specification of the Hillman 
'562 application would have led a person skilled in the art in January 1997 who was using protein 
expression monitoring in connection with working on developing new drugs for the treatment of cancers 
to conclude that a 2-D PAGE map that used the substantially purified SEQ ID NO: 1 polypeptide 
would be a highly useful tool and to request specifically that any 2-D PAGE map that was being used 
for such purposes utilize the SEQ ID NO:l polypeptide sequence. Expressed proteins are useful for 2- 
D PAGE analysis in toxicology expression studies for a variety of reasons, particularly for purposes 
relating to providing controls for the 2-D PAGE analysis, and for identifying sequence or post- 
translational variants of the expressed sequences in response to exogenous compounds. Persons 
skilled in the art would appreciate that a 2-D PAGE map that utilized the SEQ ID NO:l polypeptide 
sequence would be a more useful tool than a 2-D PAGE map that did not utilize this protein sequence 
in connection with conducting protein expression monitoring studies on proposed (or actual) drugs for 
treating cancers for such purposes as evaluating their efficacy and toxicity. 

I discuss in more detail in items (a)-(b) below a number of reasons why a person skilled 
in the art, who read the Hillman '562 specification in January 1997 would have concluded based on 
that specification and the state of the art at that time, that SEQ ID NO:l polypeptide would be a highly 
useful tool for analysis of a 2-D PAGE map for evaluating the efficacy and toxicity of proposed drugs 
for various cancers by means of 2-D PAGE maps, as well as for other evaluations: 

(a) The Hillman '562 specification contains a number of teachings that would 
lead persons skilled in the art on January 9, 1997 to conclude that a 2-D PAGE map that utilized the 
substantially purified SEQ ID NO:l polypeptide would be a more useful tool for protein expression 
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monitoring applications relating to drugs for treating various cancers than a 2-D PAGE map that did not 
use the SEQ ID NO:l polypeptide sequence. Among other things, the Hillman '562 specification 
teaches that (i) the identity of the SEQ ED NO:l polypeptide was determined from a "Prostate cDNA 
library," (ii) the SEQ ID NO:l polypeptide is the integral membrane protein referred to as IMP, and 
(iii) IMP is expressed in various libraries derived from prostate, breast, and pancreatic tumor tissues, 
and, therefore, "IMP appears to be involved in cell proliferative disorders, and to play a role in 
cancers" (Hillman '562 application at p. 28, lines 3-5, Invention section; see paragraph 9, supra). The 
substantially purified polypeptide could therefore be used as a control to more accurately gauge the 
expression of IMP in the sample and consequently more accurately gauge the affect of a toxicant on 
expression of the gene. (b) Persons skilled in the art on January 9, 1997 would have 

appreciated (i) that the protein expression monitoring results obtained using a 2-D PAGE map that 
utilized a SEQ ID NO:l polypeptide would vary, depending on the particular drug being evaluated, and 
(ii) that such varying results would occur both with respect to the results obtained from the SEQ ID 
NO:l polypeptide and from the 2-D PAGE map as a whole (including all its other individual proteins). 
These kinds of varying results, depending on the identity of the drug being tested, in no way detracts 
from my conclusion that persons skilled in the art on January 9, 1997, having read the Hillman '562 
specification, would specifically request that any 2-D PAGE map that was being used for conducting 
protein expression monitoring studies on drugs for treating various cancers (e.g., a toxicology study or 
any efficacy study of the type that typically takes place in connection with the development of a drug) 
utilize the SEQ ID NO:l polypeptide sequence. Persons skilled in the art on January 9, 1997 would 
have wanted their 2-D PAGE map to utilize the SEQ ID NO:l polypeptide sequence because a 2-D 
PAGE map that utilized protein sequence information the polypeptide (as compared to one that did not) 
would provide more useful results in the kind of protein expression monitoring studies using 2-D PAGE 
maps that persons skilled in the art have been doing since well prior to January 9, 1997. 

The foregoing is not intended to be an all-inclusive explanation of all my reasons for 
reaching the conclusions stated in this paragraph 12, and in paragraph 6, supra. In my view, however, 
it provides more than sufficient reasons to justify my conclusions stated in paragraph 6 of this 
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Declaration regarding the Hillman '562 application disclosing to persons skilled in the art at the time of 
its filing substantial, specific and credible real-world utilities for the SEQ ID NO:l polypeptide. 

13. Also pertinent to my considerations underlying this Declaration is the fact that 
the Hillman '562 disclosure regarding the uses of the SEQ ID NO:l polypeptide for protein expression 
monitoring applications is not limited to the use of that protein in 2-D PAGE maps. For one thing, the 
Hillman '562 disclosure regarding the technique used in gene and protein expression monitoring 
applications is broad (Hillman '562 application at, e.g., p. 25, lines 18-22 and p. 37, lines 1-9). 

In addition, the Hillman '562 specification repeatedly teaches that the protein described 
therein (including the SEQ ID NO:l polypeptide) may desirably be used in any of a number of long 
established "standard" techniques, such as ELISA or western blot analysis, for conducting protein 
expression monitoring studies. See, e.g.: 

(a) Hillman '562 application at p. 26, lines 1-9 ("A variety of protocols for 
detecting and measuring the expression of IMP, using either polyclonal or monoclonal antibodies 
specific for the protein are known in the art. Examples include enzyme-linked immunosorbent assay 
(ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS)"); 

(b) Hillman '562 application at p. 37, lines 1-9 ("A variety of protocols 
including ELISA, RIA, and FACS for measuring IMP are known in the art and provide a basis for 
diagnosing altered or abnormal levels of IMP expression. Normal or standard values for IMP 
expression are established by combining body fluids or cell extracts taken from normal mammalian 
subjects, preferably human, with antibody to IMP under conditions suitable for complex formation. 
The amount of standard complex formation may be quantified by various methods, but preferably by 
photometric, means. Quantities of IMP expressed in subject, control and disease, samples from 
biopsied tissues are compared with the standard values. Deviation between standard and subject 
values establishes the parameters for diagnosing disease"). 

Thus a person skilled in the art on January 9, 1997, who read the Hillman '562 
specification, would have routinely and readily appreciated that the SEQ ID NO:l polypeptide 
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disclosed therein would be useful to conduct protein expression monitoring analyses using 2-D PAGE 
mapping or western blot analysis or any of the other traditional membrane-based protein expression 
monitoring techniques that were known and in common use many years prior to the filing of the Hillman 
'562 application. For example, a person skilled in the art in January 1997 would have routinely and 
readily appreciated that the SEQ ID NO:l polypeptide would be a useful tool in conducting protein 
expression analyses, using the 2-D PAGE mapping or western analysis techniques, in furtherance of (a) 
the development of drugs for the treatment of various cancers, and (b) analyses of the efficacy and 
toxicity of such drugs. 

14. I declare further that all statements made herein of my own knowledge are true 
and that all statements made herein on information and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, and that willful false statements may jeopardize the validity 
of this application and any patent issuing thereon. 



L. Michael Furness, B.Sc. 

Signed at Exning, United Kingdom 
this day of [Month], 2003 
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A two-dimensional gel database of rat liver proteins 
useful in gene regulation and drug effects studies 

A standard two-dimensional (2-0) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding database describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing .drugs and a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel system (the lso-Dalt* 
system), it can be directly related to an expanding body of work in other laborato- 
ries. 
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1 Introduction . . 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by OTarrell and others { i— 4j, has 
been used oVer the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5, 6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in \ttro systems are ideal foranswer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some tn vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appear to show grea- 
ter quantitative reproducibility (in terms of individual pr - 
tein expression) than replicate cell cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents {e.g.. 
fetal bovine serum ). conditions i tv,'.. pH ) and genetic "evo- 
lution'of cell lines while in culture. It is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves-. 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large* protein samples, they are generally more vari- 
able, more labor-iniensive and. in the case of radiographic 
methods, may generate highly "noisy" images, due t the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible (8). Finally, 
there remains the question of the "iruthfulness^of many in 
vitro systems as compared to their in vivo analogs; how 
great are the changes caused by the intr duction into a cul- 
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ture and the associated shift to strong selection for growth, 
and h w do these a/Tea experimental outcomes? Hence 
the apparent advantages of in vitro systems, in terms of ex- 
perimental manipulation, may be counterbalanced by 
other factors relating to 2-D data quality. 

There is a second important class of reasons for exploring 
the use of an in vivo biological system such as the liver. His* 
toricaJly, there have been two broad approaches to the me- 
chanistic dissection of biochemical processes in intact cel- 
lular systems: genetics (a search for informative mutants) 
and the use of chemical agents (drugs and chemical toxins). 
Both approaches help us to understand complex systems 
by disrupting some specific functional element and show- 
ing us the result. With the development of techniques for 
genetic manipulation and cloning, the genetic approach 
can be effectively applied either in vitro or in vivo, although 
the in vitro route is usually quicker. The chemical approach 
can also be applied to either sort of biological system; here, 
however, the bulk of consistently acquired information is 
in experimental animals (rats and mice). While most biolo- 
gists know a short list of compounds having specific, experi- 
mentally useful effects (e.g.. inhibitors of protein synthesis, 
ionophores, polymerase inhibitors, channel blockers, nu- 
cleotide analogs. and compounds aflecting polymerization 
of cytoskeletal proteins), there is a much larger number of 
interesting chemically-induced effects, most of them char- 
acterized by toxicologisis and pharmacologists in rodent 
systems. Just as a thorough genetic analysis would involve 
saturating a genome with mutations, it is possible to ima- 
gine a saturating number of drugs, the analysts of whose ac- 
tions would reveal the complete biochemistry of the cell. 
While organized drug discovery efforts usually target spe- 
cific desired effects, the nature of the process, w ith its de- 
pendence on screening large numbers of compounds, ne- 
cessarily produces many unanticipated effects. It is there- 
fore reasonable to suppose that the required broad range of 
compounds necessary to achieve "biochemical saturation" 
may be forthcoming: in fact, it may already exist among the 
hundreds of thousands of compounds that failed to qualify 
as drugs. 

Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness. The brain appears to be quite plastic (e.g. 
[7]), but it is a complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidney, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importantly, the liver performs many homeostatic func- 
tions that require rapid modulation of gene expression. It 
appears that most chemical agents tested -affect gene ex- 
pression in the liver at some dosage (N. Leigh Anderson, 
unpublished observations), an interesting contrast to our 
earlier work with lymphocytes, for example, which seem to 
be much less responsive.Such results conform to the expec- 
tation that cells with a homeostatic, physiological role 
should be more plastic than cells differentiated for a pur- 
pose dependent on the action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
vivo systems to be examined in detail. Significant progress 



has been made in the development of mouse rai a r* 
man hepatocyte culture systems, as w ell as in precisio hw 
tissue slices. Using such an array of techniques, it » * 
ble to assemble a matrix of mammalian systems inciu °* 
mouse and rat in vivo on one level and mouse, rat and** 
man in vitro on a second level, and to compare efTecw 
tween species and between systems. This approach alio^ 
us to draw informed conclusions regarding the biochen- 
"universality" of biological responses among the manir T 
and to offer some insight into the validity of in vitrei 
proaches for toxicological screening. We believe this d?* 
will be necessary If in vitro alternatives are to achieve u*^ 
usage in government-mandated safetytesting of drugs 
sumer products and industrial and agricultural chemical? 

A number of interesting studies have been published usm- 
2-D mapping to examine effects in the rodent liver. A nu- 
ber of investigarors have made use of the technique 
screen for existing genetic variants [8-11] or induced nur«- 
lions [12-14], mainly in the mouse. This work builds on trT; 
wealth of genetic information available on the mouse ani 
its established position as a mammalian mutation-deteJ. 
lion system. While some studies of chemical effects haV? 
been undertakerrin the mouse [ 15-17]. most have used th; 
rat [18-23]. The examination of the cytochrome p-450s>> 
tem, in particular, has been carried out almost exclusive!) 
on the rat [24. 25]. 

These considerations lead us to conclude that rodent live: 
ofTers the best opportunity to systematically examine ar. 
arcay of gene regulation systems, and ultimately to build a 
predictive model of large-scale mammalian gene control. 
The basic underlying foundation of such a project is a reli- 
able, reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred. In this paper, we 
report such a master pattern for the acidic and neutral pro- 
teins of rai liver(pattern F344MST3).ln future. this master 
will be supplemented by mapsof basic proteins, and analog- 
ous maps of mouse and human liver. 



2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material for most biochemical stud- 
ies, including 2-D analysis. A sample is taken of approxima- 
tely 0.5 g of tissue from the apical end of the left lobe of the 
liver. Solubilization is effected as rapidly as practical; a 
delay of 5-15 min appears to cause no major alteration in 
liver protein composition if the liver pieces are kept cold 
(e.g., on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenucr 
(e.g., 15 mLWheaton); ^volumes of solubilizing solution 



• The solubilizing solution is composed of 2% NP-40 (Sigma).* * ' 
(analytical grade, rr. BDH or Bio-Rad). 0.5% dilhiolhreitol (DT^- 
Sigma) and 2% carrier ampholytes (pH 9-11 LKB: these comeiS J jLj 
stock solution, so 2 % final concentration is achieved by making the 
solution 10% 9-1 1 Ampholine by volume). A large batch of *° lul "* 
(several hundred mL) is made and stored frozen at -80°C in • 
sufficient to provide enough for one day's estimated sample pf* P— ^ 



tion requirement. The solution is never allowed to become W1 



conu» r 



than room temperature at any stage during preparation or 
use, since heating of concentrated urea solutions can produce 
nants that covalently modify proteins producing artifactual 
shifts. Once thawed, any unused solubilizer is discarded. 
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I (i.e., 4 mL per 0-5 g tissue) and the mixture is ho- 
ed using first the loose- and then then the tight-fit- 
g^jass pestle. This takes approximately 5 strokes with 
£ pestle and is carried out at room temperature because 
£ would crystallize out id the cold. Once the liver sample 
thoroughly homogenized in the solubilizer. it is assumed 
il til the proteins are denatured (by the chaotropic effect 
the urea and NP-40 detergent) and the enzymes inacti- 
jed by the high pH (-9J). Therefore these samples may 
itept at r om temperature until they can be centrifuged 
frozen as a group (within several hours of preparation). 
1C samples are centrifuged for 6 X 1 0* g min [e.g.. 500 000 
f for 12 min using a Beckman TL-100 centrifuge). The 
aitrifuge rotor is maintained at just below room tempera- 
te (*•*•« 15— 20°C), but not too cold, so as to prevent the 
xcipitation of urea. The centrifuge of choice is a Beckman 
H00 because of the sample tube sizes available, but any 
Itracentrifuge accepting smallish tubes will suffice. When 
l appropriate centrifuge is not available near the site of 
unple preparation, samples can be frozen at -80 'C and 
lawed prior to centrifugation and collection of superna- 
uits.Each supernatant is carefully removed following cen- 
ifugation and aliquoted into at least 4 clean tubes for stor- 
je.This is done by transferring all the supernatant to one 
lean tube, mixing this gently (to assure homogeneous 
□imposition) and then dividing it into 4 aliquots. The ali- 
uots are frozen immediately at — 80°C. These multiple ali- 
uots can provide insurance against a failed run or a freezer 
•reakdown. 



12 Two-dimensional electrophoresis 
& 

Sample proteins are resolved by 2-D electrophoresis using 
& 20 X 25 cm Iso-Dalt* 2-D gel system ((26-29); pro- 
ceed by LSB and by Hoefer Scientific Instruments, San 
Francisco) operating with 20 gels per batch. All first-dimen- 
iional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
JDH 4-8 A in the present case, selected by LSB's batch* 
testing program for rat and mouse database work**). A 10 
uLTsample of solubilized liver protein is applied to each gel, 
ujd the gels are run for 33000 to 34500 volt-hours using a 
progressively increasing voltage protocol implemented by 
^programmable high-voltage power supply. AnAnge- 
toque* computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
ium dodecyl sulfate (SDS) polyacrylamide gradient slab 
£eb in which the top 5 % of the. gel is 1 1 %T acrylamide, and 
the lower 95 % of the gel varies linearly from 1 1 % to 1 8 %T. 

gus system has recently been modified so as to employ a 
commercially available 30.8%T acrylamide/A f ,A p -methyle- 
^jbisacrylamide prepared solution (thus avoiding the han- 
ging of the s lid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tns), persulfate and A'.MA^AMetramethylethylenedi- 
jmine (TEMED). Each gel is identified by a computer- 
tented filter paper label polymerized into the lower left cor- 
^of the gel. First-dimensional IEF tube gels are loaded 

s material (succeeding certified batches of which are available from 
ioefer Scientific Instruments) has the most linear pH gradient pro- 
duced by any ampholyte tested except for the Pharmacia wide range 
j(*bich has an unacceptable tendency to bind high-molecular weight 
sdic proteins, causing them to streak). 



directly (as extruded) onto the slab gels with ut equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies'*, produced by LSB) to avoid the use of hot agarose. 
Second-dimensional slab gels are run vernight. in groups 
of 20, in cooled DALT tanks (10°C) with buffer circulation. 
All run. parameters, reagent source and lot information, 
and notations or deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-page 
record of the experiment. 

23 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
of Neuhoff [30, 31]) involves fixation in 1.5 L of 50% etha- 
nol and 2% phosphoric acid for 2h. three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17% ammonium sulfate and 2% phosphoric acid 
for 1 h. followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon geis are 
transferred to cool tap water an<J their surfaces rinsed to re- 
move any particulate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal. with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry; (ii) up to 1500 spots can be reli- 
ably detected on many gels (e.g.. rat liver) at loadings low 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 



1JS Computer analysts 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup f the gel image. Gels are pr cessed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCHO system [34- 
41]. Procedure PROC008 is used to yield a spotlist giving 
position* shape and density information for each detected 
spot. This procedure makes use of digital filtering, mathe- 
matical morphology techniques and digital masking to re- 
move the background, and uses full 2-D least-squares opti- 
mization to refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored in a relational database, while various log files detail- 
ing operation of the automatic analysis software are ar- 
chived with the reduced data. The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (e.g.. 
treated and control animals). Each 2-D pattern is matched 
to the appropriate "master** 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the existing rodent protein 2-D data- 
bases. The software allows experiments containing hun- 
dr ds of gels to be constructed and analyzed as a unit, with 
up t iOO gels displayed on the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
showing significant quantitative differences vs. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g.. Student's t-test, Kepler* procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinately regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map,and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components* ana- 
lysis) is performed on data exported to S AS (SAS Institute). 

2.6 Graphical data output 

Graphical results are prepared in GKS and translated 
within Kepler* into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
P stscript-corapatible Linotronic output device. Greyscale 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 

2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755 M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ceuticals, ground and mixed with the diet at r nn . 
of 0.075% and 1 %, respectively. The high iaS^ 



was Purina 5801M-A (5Vchoi^ 
late in the control diet). Animal work was carried omT 
crobiological Associates (Bethesda, MD). Animals w " 
climatized for one week on the control diet, fed test <f 
trol diets for one week, and sacrificed on day 8 mI^' 
daily doses of lovastatin and cholestyramine in aporon 
groups were 37 mg/kg/day and 5 g/kg/day. respect^ 
based on the weight of the food consumed. Liver sam i 
were collected and prepared for 2-D electrophoresis aer 
ing to the standard liver protocol (homogenizatirJ , 
volumes of 9 m urea, 2% NP-40. 0.5% dithioihreXi "L 
LKB pH'9-1 1 carrier ampholytes, followedTSSf " 
lion for 30 min at 80000 X g). Kidney, brain'and 
samples were frozen. Gels were run as described ahn 
and the data was analyzed using the Kepler* system f i 
were scaled, to remove the effect of differences in proiem 
loading, by setting the summed abundances of a large num 
ber of matched spots equal for each gel (linear scaling) * 



3 Results and discussion 

> 

3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 uLof solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant, Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal p/standards) and SDS molecular mass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table l).B e * 
cause of the precision of the CPK-p/ values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using pi measurements expressed 
as pH. A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pK>* 
teins, as well as rigorous definitions of subcellular orga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse li yCf 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results f these studies will be 
presented systematically in a later edition of this database 
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r j include here a useful series of 22 orienting identifi- 
tJons as an aid to other users of the rat liver pattern (Table 



fCarbamylaied charge sundards, competed pf s and 
molecular mass standardization 

febave previously shown that the use of a system of close- 
spaced internal pi markers (made by carbamylating a 
jsic protein) ofTers an accurate and workable solution to 
!e problem of assigning positions in the p/dimension [32]. 
be same system, based on 36 protein species made by car- 
amylating rabbit muscle CPK- has been used here to as- 
ign pfs to most rat liver acidic and neutral proteins. The 
[andards were coelectrophoresed with total liver proteins, 
n d the standard spots added to a special version of the 
laster pattern F344MST3. The gel A-coordinates of all 
; ver protein spots lying within the CPK charge train were 
ben transformed into CPK pi positions by interpolation 
letween the positions of immediately adjacent standards 
Table 1) using a Kepler* vector procedure. 

t has proven possible to compute fairly accurate p! values 
of many proteins from the amino acid composition [42]. 
Wt have attempted here to test a further elaboration of this 
ipproach. in which we computed pf s forthe CPK standards 
Jiemselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact that adjacent members of the 
iarge train typically differ by blockage of one additional ly- 
one residue (Table 3). We compared these values to similar 
computed p/s for an additional set of carbamylated stand- 
ards made from human hemoglobin beta chains and a se- 
ries of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig. T.Table 4).The result demonstrates 
good concordance between these systems. Two proteins 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(#20 in the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
parent spot closer to the expected p/, not resolved in the 
DEF/SDS gel. Of particular importance is the fact that, by 
comparing computed pfs of sequenced but unlocated pro- 
teins with the CPK pfs, we can assign a probable gel loca- 
$od without making any assumptions regarding the actual 
gel pH gradient. This ofTers a useful shortcut, given the va- 
garies of pH measurement on small diameter IEF gels. We 
||ye used this approach to compute the CPK pf s of all rat 
Sdmouse proteins in the PIRsequence database,as an aid 
protein identification (data not shown). 

border to standardize SDS molecular weight (SDS-MW), 
j have used a standard curve fitted to a series of identified 
jgpteins (Fig. 8). Rather than using molecular mass per se y 
*e have elected to use the number of amino acids in the 
{^peptide chain, as perhaps a better indication of the 
* jgth of the SDS-coated rod that is sieved by the second 
tension slab. The resulting values were multiplied by 
(the weighted average mass of amino acids in se- 
Benced proteins) to give predicted molecular masses. Be- 
use we use gradient slabs, we have n t constrained the fn- 
Vcurve t conform to any predetermined model; rather 
^tried many equations and selected the best using the 
Ingram Tablecurve" on a PC. The equation ch sen was y 
J+ bx +C/X 2 , where vis the number f residues,* is the gel 



Y coordinate, a is 5 1 1 .83, b is -0 271 1 and c is 33 1 83 80 1 . The 
resulting fit appears to be fairly good over a broad range of 
molecular mass. 

3 -3 Ad example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by three 
agents included in the diet: lovastatin (Mevacor*,an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequestrant that has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol itself. The 
first two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment offers an interesting test of the 2-D mapping 
system since most of the pathway enzymes are present in 
low abundance, many are membrane-bound and difficult 
to solubilize, and the pathway itself is complex. Approxima- 
tely 1000 proteins were separated and detected in liver ho- 
mogenates.Twenty-one proteins were found to be affected 
by at least one treatment, andHhese could be divided into 
several coregulated groups. 

33.1 MSN 413 (putative cytosolic HMG-CoA synthase) 
and sets of spots regulated coordinate!} or inversely 

One group of spots (including a spot assigned to the cyto- 
solic HMG-CoA synthase, MSN 413) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number 413 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075 % lovastatin and 
1% cholestyramine in the diet (Figs. 9 and 10). Its expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine, and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp & Dohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reductase in the liver cholesterol biosynthesis pathway, and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK pi of -1 1.4, in 
reasonably close agreement with a molecular weight of 
57300 and a CPK p/ of -15.7 computed from the known se- 
quence of the hamster enzyme (43). 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level of correlation 
was exceedingly high (> 95%). Two of these, 1250 and 933, 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the bservation that both 
spots are also stained by the antibody t cytosolic HMG- 
CoA synthase. The remaining three correlated spots appear 
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to comprise an additional related pair (1253 and 1001) of 
around 40 kDa and a single spot (1119) of ar und 28 kDa 
Because these two presumed proteins are present at sub- 
stannally lower abundances than 4 13. and because the cyto- 
solic HMG-CoA synthase is reported to consist of nlvone 
type of polypeptide, ihey are likely to represent the/very 
tightly coregulated enzymes. A second group of six spots 
was selected based on a regulatory pattern close to the in- 
verse of that forspot 413 (MSN-$34,79. 178, 182,204,347 
data not shown). For these proteins, the lowest level of ex- 
pression occurs with exposure to lovastatin plus cholestyra- 
mine and the highest level upon exposure to the high-cho- 
lesterol diet. Spots 182 and 79 are highlv correlated and lie 
about one charge apart at the same molecular weight; they 
may thus be isoforras of a single protein. The other four 
spots probably represent additional enzymes or subunits. 

33 2 MSN 235 and coregulated spots 

A third group of five spots, mainly comprised of mitochon- 
drial proteins including putative mitochondrial HMG- 
CoA synthase spots, showed a modest induction hv 
tin aione, but little or no effect with any of the other treat- 
ments (including the combination of lovastatin and choles- 
tyramine; Fig. 12). This result is intriguing because lovasta- 
tin was expected to a/Tect only the regulation of enzvmes of 
cholesterol synthesis, which is entirelv extra-mhochon- 
dnal. Three of the spots (235, 134, 144) form a closely- 
packed triad at approximately 30 kDa, and are likely to re- 
present isofonns of one protein. All three spots are stained 
by an antibody to the mitochondrial form of HMG-CoA 
synthase obtained from Dr. Greenspan. Subcellular fractio- 
nation indicates a mitochondrial location. The other two 
spots (633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
tnad. 

3 33 An example of an anti-synergistic effect 

A sixth spot (367) shows strong induction by lovastatin 
two- to threefold), and about half as much induction with 
lovastatin plus cholestyramine, but without sharing the ani- 
mal-animal heterogeneity pattern of the 235-set (Fig 13) 
This protein is also mitochondrial, and represents the clear- 
est example of an anti-synergistic effect of lovastatin and 
cholestyramine. The existence of such an effect demon- 
strat s that lovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

33.4 Complexity of the cholesterol synthesis pathway 

Taken together, these results suggest that treatment with lo- 
vastatin alone can affect both cytosolic and mitochondrial 
pathways using HMG-CoA, while cholestyramine, on the 
other hand, either alone or in combination with lovastatin 
produces a strong efTect on the putative cytosolic pathway! 
but little or no effect on the putative mitochondrial path- 
way. An explanation for this difference may lie in I vasta- 
tin s efTect on levels of HMG-CoA and related precursor 
compounds that are exchanged between the cytosol and 
the mitochondrion, whereas cholestyramine should affect 
only the cytosolic pathways directly controlled by cholester- 
ol and bile acid levels. It remains to be explained why some 



proteins of the putative mitochondrial pathwav 
much more variable in theirexpression in all gro unt V e *° 
animation of all the coregulated groups sug J«, S T An e »- 
titative statistical techniques can extract a w ea i,h „ r • 
esting information from large sets of reproducible ^ 
abundance of spots in the 413 coregulation grouo for "* 
ple.shows an amazing level of cone rdance in inch ,m " 
expression among the five individuals of the lovasur * 
cholestyramine treatment group. This effect is no? J? ,Bfi 
differences in total protein loading, since thevhave «i« 1° 
been removed by scaling, and since proteins with Q lw> 
ferent regulation patterns can be demonstrated ( e , f 
13).Such effects raise the possibility that manvgene . 
lation sets may be revealed through the studv of 
ciently large population of control animals (/', u 
any experimental manipulation). This approach extC 
natural biological variation in protein expression insteaX 
drug effects, offers an important incentive for the constfu 
tion of a large library of control animal patterns. 



4 Conclusions 

> 

Because of the widespread use of rat liver in both basic bio- 
chemistry and in toxicology, there is a long-term need for a 
comprehensive database of liver proteins. The rat liver mas- 
ter pattern presented here has proven to be an accurate re- 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation. Its practical utility in sev- 
eral areas of mechanistic toxicology is alreadv being de- 
monstrated. 

Received September 11. 1991 
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Schematic reprcsenutioo of the master pattern (the same as Fig. 1 ), useful as an aid in relating specific areas of Fig. 1 and the following detailed 
trains. 
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lyre 4. Upper right (high molecular weight, basic) quadrant (#2) of the rat liver map, showing spot numbers. 
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Figure S. tower left (low molecular weight, acidic) quadrant (#3) of ihc rat liver map. showing spot numbers. 
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<w 6. Lower right (low molecula/ weight, basic) quadrant (#4) of the rat liver map, showing spot numbers. 
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Figures. Plot of number of amino acids 



Gel Y Coordinate 



curve used to predict molecular mass of unidentified 



versus gel J'-posmon.u,^ r ... 



proteins 



-2D -to 

CPK position 



< Figure 7. (a) Plot of computed isoelecuic point v C rsus gd Jl-p 0 $iuon fo- 
two sets of carbamylated standard proteins (rabbit muscle CPK M in- 
human hemoglobin $ chain, filled diamonds) and several other protein! 
(shaded squares). <b) The identities of the various proteins representee 
by the squares are indicated by the numbers in corresponding positions 
on (a); these refer to Table 4. 
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Figure 9. Montage showing efTects in the 
region of MSN:413.The monuge show* 
smaJI window into one portion of the 3-D 
pattern, one row of windows for each ctff 
rimental group, and one panel for each |*J 
in the experiment. The left-most pitteft 
in each row is a group-specific copyo^ 1 ** 
master pattern followed by the pattf" 15 
for the five individual rats in the group- 
The highlighted protein spots (filled ciff* 
les) are spot A 13 (on the nghi of each p** 
el; identified as cytosolic HMG-CoAS5» 
thase) and two modified forms of it 
and 933). From the top. the rows (erpe"" 
menial groups) are: high cholesterol. 
trols. cholestyramine, lovastatin. and to**" 
statin plus cholestyramine. 
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Figure 10. Bargraph showing the quantita- 
tive effects of various treatments on the 
abundance of MSN:413 (cytosolic HMG- 
CoA synthase) in the gels of Fig. 9. 




Figure //. Bargraphs of a series of six core- 
gutated spots including MSN;4I3. In the 
bargraphs. the abundances of the appro- 
priate spot (muter spot number shown at 
the top of the panel) in each animal are 
shown. The five five-animal groups are in 
the order {left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovasutin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver (one 2-D gel). Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 
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Ftfvre i:. Data on a second co regular 
proupofspois. presence as in Fit 11 T-. 
founh experimental group (iovasia::.-. 
shows a modesi inducuon. while the fifr 
group (iovasiaiin plus cholestyramine- 
does not. 
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Figure 13. Data on spot M5N:367. presented is in Fig. U - T"'* 



rigvrc i j. u*lm on spot Mjn;jo/, presented a$ in rig. u- .•»■• r 
shows unambiguously the anti-synergistic effect of lovasutin and to-— 
tyramine (fifth group) as compared to lovasutin (fourth group)!** 
ponse contrasts strongly with ihe regulation pattern seen in Fig- lh 



feodum 2: Tables 1—4 
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Y CP** SOSMW 



Y CPW SOSMW 



15 
17 
16 
19 

20 

n 

22 
23 
24 
25 



311 
566 
612 
S49 
845 
629 
906 
755 
649 
1204 
332 
787 
313 
607 



27 1184 

28 1263 



29 
30 
32 
33 
34 
35 
36 
36 
35 



743 
768 
1216 
1145 
1037 
863 
712 
763 
304 



41 1165 

42 684 

43 1318 

44 1924 

46 1203 

47 1391 



48 
40 

.50 



309 
605 
621 



51 1113 
L 52 1820 



53 
54 

55 
56 



725 
2001 
722 
678 



57 1682 

56 1091 

59 1171 

60 1400 



61 
62 
65 



1853 
1888 
735 

66 1263 

67 12S2 

68 779 

69 1064 

71 656 

72 638 

73 1582 

74 1570 



75 
76 
77 
78 
79 
80 



1264 
1336 
1833 
1767 
925 
534 



81 1811 

-82 1412 

83 1471 

64 1662 

65 1596 

66 1817 



87 
88 
89 
90 
91 
92 

93 1338 

94 1708 



516 
1589 
1706 

651 
1415 
1773 



434 

263 
426 

266 
520 
569 
414 
296 
403 
448 
434 
424 
417 
516 
S24 
446 
605 
112 
417 
445 
S55 
412 
06 
694 
470 
569 
607 
569 
362 
566 
447 
454 
587 
535 
522 
499 
177 
500 
830 
533 
302 
580 
565 
624 
508 
567 
297 
312 
407 
692 
296 
589 
545 
583 
556 
621 
564 
363 
565 
738 
698 
363 
681 
347 
563 
479 
301 
1371 
698 
719 
329 
710 
545 
446 
696 



«45.0 
-24.3 
-16.0 
-25.2 
•15.3 
-21.6 
-14.0 
-17.5 
-20.9 
4.7 
<45.0 
•16.6 
<-35.0 
-16.1 
-9.0 
-6.0 
-17.8 
-17.2 
-8.6 
4.5 
-11.3 
-14.9 
-16.7 
-17 J 
<-35.0 
■9.2 
-19.6 
-7.3 
-01 
-6.7 
-6.3 
<-35.0 
-22.5 
-21.8 
-10.0 
4.9 
-18.3 
>0.0 
-184 
-19.6 
-2.5 
-10.3 
4.2 
-6.2 
-0.6 
-0.4 
-18.1 
-8.0 
-8.1 
-16.8 
-10.8 
•20.6 
•21.2 
-3.6 
-3.8 
-6.0 
-7.0 
-0.8 
-1.5 
-13.6 
-26.1 
-1.0 
-6.0 
-5.0 
-2.7 
-34 
-0.9 
-27.0 
-3.5 
-2.2 
-20.6 
-6.0 
-1.4 
-7.0 
-2.2 



63.800 
102.900 
64.800 
101.000 
55.200 
50.000 
66.300 
90.200 
67.900 
62.100 
63.800 
65.000 
66.000 
55.500 
54.900 
62.400 
49.000 
348.600 
66.000 
62.500 
52.400 
66.600 
48.900 
43.800 
50.800 
51.400 
48.800 
50.000 
74,600 
50.200 
62.300 
61.500 
50.100 
53.900 
55.000 
57.000 
170,800 
56.900 
37.300 
54.100 
89.000 
50.60C 
50.300 
47.800 
56.200 
51.500 
90.500 
85.900 
67.300 
43.900 
90.80C 
50.000 
53.100 
50.400 
52.300 
48.000 
51.800 
74.400 
51.700 
41.600 
43.600 
74.500 
44.500 
77.500 
51.800 
58.900 
89.100 
17.400 
43.600 
42.500 
81.700 
43.000 
53.200 
62.300 
43.700 



96 1119 

96 1731 

97 1033 
96 1406 
69 578 

100 2004 

101 1106 



102 
103 
104 
106 
106 
107 



482 
665 

773 
312 
1769 
1585 



108 1692 

109 1482 



110 
111 



778 
1728 



113 1191 



114 

115 
116 



1296 
682 
1146 



117 1548 

118 1060 

120 1530 

121 838 

122 1572 

123 23 



124 

125 
126 
127 
128 



621 
1296 

872 
1000 
1229 



129 1422 

130 1776 



131 
132 
133 
134 



1930 
660 
666 

1271 



135 1161 

136 453 

137 1858 

138 1504 

139 1488 

140 1689 



141 
142 



311 
1366 



143 1429 

144 615 

145 2006 

146 2006 

147 1070 

148 1347 



149 

150 
151 



541 
1645 
1269 



152 1507 

153 1722 

154 932 

155 1031 

156 1970 

157 1258 

158 1275 

159 1663 

160 1034 

161 1953 

162 1020 
164 1566 

166 1905 

167 1340 

168 1506 

169 1338 



170 
171 
172 
173 



1969 
800 
476 
919 



536 
756 
566 
565 
1149 
538 
623 
455 
830 
1182 
1117 
509 
720 
807 
593 
516 
700 
680 
185 
907 
610 
849 
577 
828 
423 
712 
1433 
1474 
662 
921 
717 
311 
832 
499 
757 
537 
1019 
862 
1389 
1063 
823 
697 
707 
756 
1417 
915 
346 
1017 
566 
518 
1108 
578 
T481 
760 
236 
911 
448 
503 
294 
684 
163 
417 
820 
527 
771 
1482 
806 
565 
181 
583 
678 
S41 
378 
958 
1314 



-9.9 
-2.0 
•11.4 
-6.1 
•23.8 
>0.0 
-10.1 
-28.5 
-20.2 
-17.0 
<-35.0 
-1.5 
-3.6 
-2.4 
-4.8 
-16.9 
-2.0 
-8.9 
-7.5 
•19.6 
-6.5 
-4.1 
-11.1 
-4.3 
-15 4 
-3.8 
«-35.0 
•21.9 
-7.5 
-14.7 
•12.0 
-64 
-5.8 
-1.4 
-0.1 
-20.4 
-20.2 
-7.9 
-9.3 
-29.7 
-06 
-46 
-4.8 
-2.4 
<-35.0 
•6.7 
-5.7 
-22.1 
>0.0 
>0.0 
-10.7 
-6.9 
•257 
-28 
-7.9 
-4.5 
•2.1 
-13.5 
-11.4 
>0.0 
-8.1 
•7.8 
•2.6 
-11.4 
>0.0 
-11.6 
-3.8 
-0.2 
-7.0 
-4.6 
-7.0 
>0.0 
•16.3 
-28.7 
•13.7 



53.800 
40.700 
51.600 
51.700 
25.000 
53.700 
47.900 
61.300 
37.300 
23.800 
26.100 
56.100 
4Z500 
38.30C 
49.700 
55.50C 
43.S0C 
44.500 
160.60C 
34.10C 
48.70C 
36.500 
50.80C 
37.40C 
65.20C 
42.90C 

15.3a 

13.90C 
36.00C 
33.5a 
42.6a 
86, 10C 
37.3a 
57.CXX 
40.7a 
53.8a 
29.7a 
36.0a 

i6.ea 
28.ia 

37.7a 
43.7a 

43,2a 

40.7a 

is.ea 

33.8a 
77,9a 
29.8a 

5i.6a 
55.3a 

26.5a 
50.8a 

13.7a 
40.5a 
n7,oa 
33.9a 

62.100 
56.6a 

9i.4a 

44 400 

162.400 
6S.9O0 
37,8a 

54.6a 

40.000 

13.7a 

38.4a 

51.7a 
i64.9a 

50.4a 
44.7a 
53.500 
71.800 

32.1a 
19.3a 



MSN 



Y CPKerf SOSMW 



174 
17S 
177 
178 
179 
160 
161 
162 
184 
185 
186 
187 
188 
191 
192 
193 
194 
195 
196 
167 
196 
199 

2a 

202 

203 

204 

205 

206 

207 

208 

210 

211 

213 

214 

215 

216 

217 

218 

219 

220 

221 

223 

225 

226 

227 

228 

229 

230 

232 

234 

235 

236 

237 

238 

239 

240 

241 

242 

243 

244 

245 

246 

247 

248 

249 

250 

251 

252 

253 

254 

255 

256 

2S7 

258 



1364 
825 
1582 
1321 
1089 
1866 
411 
804 
1860 
1997 
279 
773 
1536 
1560 
1818 
1469 
1380 
784 
1227 
667 
2006 
1711 
872 
ou. 
73ft. 
786 
1224 
439 
1994 
1895 
240 

17a 

902 
1067 
1340 
15G1 
1585 
1159 
931 
713 
1479 
965 
934 
1812 
821 
1566 
1065 
1577 
1458 
1440 
1692 
618 
920 
952 
1611 
1489 
501 
1820 
1357 
711 
1855 
1189 
551 
1348 
460 
1733 
1974 
808 
874 
753 
995 
1690 
994 
508 
1517 



183 
393 
553 

710 
615 
567 
295 
730 
896 
1017 
1113 
296 
807 
674 
687 
555 
266 
632 
1185 
553 
681 
674 
424 
435 
253 
829 
589 
983 
571 
687 
1418 
499 
517 
684 
668 
495 
755 
393 
572 
177 
911 
927 
716 
1045 
411 
1483 
567 
890 
496 
849 
489 
1004 
1138 
1006 
541 
720 
448 
569 
656 
1182 
621 
474 
459 
604 
448 
451 
788 
392 
553 
848 
450 
679 
1006 



820 



4.7 
-15.7 
46 

-7.2 
-10.4 
4.5 
•32.1 
-16.2 
4.6 
>0.0 
<-35.0 
-17.0 
-4.2 
•3.9 
4.9 
-5.0 
44 
-16.7 
4.4 
-20.1 
>00 
-2.2 
-14.7 
<-35.0 
-18.0 
-16.7 
4.5 
-30.9 
>0.0 
4.3 
<-35.0 
•2.3 
-14.1 
-10.4 
-7.0 
•3.5 
-3.6 
-9.3 
-13.5 
•18.7 
-4.9 
-12.6 
-13.5 
-1.0 
-15.8 
-3.6 
-10.8 
-3.7 
-5.2 
•55 
-2.4 
-22.0 
•13.7 
-13.1 
-3.2 
-4.8 
-27.7 
49 
4.8 
-18.7 
4.6 
4.9 
•25 1 
4.9 
-29.3 
-1.9 
>0.0 
-16 1 
•14.6 
-17.6 
-12.1 
•24 
•12.1 
-274 
-4 4 



162.900 
69.300 
52.600 
43.000 
48.300 
51.600 
91 200 
42.0a 
* 34.5a 
29.8a 
26.3a 
90.8a 
38.4a 
44.9a 

44 ja . 

52.4a 
101.600 
47.3a 
23.7a 
52.6a 

44.5a 

44.9a 

65.0a 
63,7a 

i07,sa 

37.4a 

so.oa 
31.1a 
51.3a 
44.2a 
15.8a 
57.0a 

55.4a 
44.4a 

45.2a 

57.3a 

40.7a 

69.3a 

51.2a 

170.5a 
33.9a 
33.3a 
42.7a 

28.ea 
66.ea 
13.6a 
5i.6a 
34,ea 

57.3a 
36.5a 
57.9a 
30.3a 
25.4a 

30.2a 

53.5a 
42.5a 

62.ia 
5i .4a 
45.ea 

23.8a 

48,oa 

59.3a 

6i. oa 
49.1a 
6iia 
6i.sa 
39.2a 

69.500 
52.5a 
36.5a 

61.9a 

44.6a 

30.2a 

60.4a 

37.8a 



isoelectric point relative to CPK standards, and 



924 L Aatfcnoe ft si. 



tifCirufik, 



MSN 

2S0 

260 

261 

262 

263 

265 

266 

267 

266 

260 

270 

271 

272 

274 

275 

276 

277 

278 

27B 

281 

282 

283 

284 

285 

286 



Y CPKrf SD&AV 



MSN 



Y CPW SOSMW 



1706 
661 
172S 
406 
1063 
1300 
510 
660 
430 
1044 
2010 
657 
605 
1202 
1350 
1670 
686 
061 
670 
1848 
1505 
1313 
1314 
1332 
1277 



200 
201 
202 
293 
294 
295 
296 
297 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 
311 
312 
313 
314 
315 
316 
320 
321 
322 
323 
324 
325 
326 
327 
328 
330 
331 
332 
333 
334 
335 
336 
338 
339 
340 
341 
343 
344 



1147 
025 
787 
1462 
S31 
860 
1162 
218 
1377 
013 
2012 
702 
494 
403 
1843 
1049 
1608 
1210 
1627 
1524 
1760 
1600 
266 
1002 
1316 
1341 
1104 
1460 
850 
1454 
670 
655 
1521 
1567 
1368 
448 
1606 
1566 
531 
784 
1050 
1593 
1616 
1854 
1265 
581 
1407 
1351 
1613 



061 
1361 
670 
1127 
172 
673 
437 
1038 
061 
606 
853 
422 
968 
712 
500 

1069 
536 
718 
570 

1084 
525 

1147 
620 
408 

652 

570 
511 
1476 
816 



609 
814 
979 
1523 
667 
178 
1280 
1008 
1565 
533 
989 
916 
755 
692 
1028 
1451 
1408 
1365 
1395 
523 
1053 
1459 
603 
1494 
626 
101 
675 
677 
409 
1291 
751 
697 
471 
1156 
■407 
303 
598 
1004 
688 
565 
1047 
265 
540 



•1.1 
-20 4 
•2.0 
-28.0 
-10.9 
-6.3 
-273 
•204 
-31.0 
-11.2 
>0.0 
-15.0 
•14.2 
•7.6 
•6.9 
-2.6 
-19 4 
•13.0 
-14.5 
-07 
•4.6 
-73 
-7.3 
-7.1 
•7.6 
^3 
-6.5 
-13.6 
•16.6 
-5.1 
-26.3 
-14.9 
-6.3 
<-35.0 
-6.5 
-13.9 
>O.0 
-19.0 
-28.1 
-32.6 
-07 
-11.1 
-3.3 
-8.5 
-3.0 
-4.4 
-1.5 
-3.3 
<-35.0 
-0.3 
-7.3 
-7.0 
-10.1 
-4.9 
-15.1 
-5.3 
-20.0 
•20.6 
-4.4 
-3.6 
-6.3 
•30.0 
•3.3 
•3.6 
•26.3 
-167 
-10.9 
-3.5 
-3.2 
-0.6 
•6.0 
•23.6 
-4.7 
-6.8 
-0.0 



31.000 
17.700 
44.600 
25.800 
177.400 
45.000 
63.400 
29.000 
31.000 
48.000 
36.300 
65.200 
31.700 
42.900 
49.900 
27.100 
53.700 
42.600 
51.300 
27.300 
54.800 
25.100 
37.400 
67.200 
46.100 
37.600 
50.700 
55.900 
13.900 
37.800 
62.000 
43.600 
48.700 
38.000 
31.300 
1Z400 
45.300 
169.200 
20.400 
30.100 
10.300 
49.800 
30.000 
33.700 
40.700 
34.700 
20.400 
14.700 
16.100 
17.600 
16.600 
54.000 
28.500 
14.400 
40.100 
13.300 
47.700 
420.500 
44.800 
44.700 
67.000 
20.100 
40.000 
43.700 
50.600 
24.700 
67,300 
66.500 
40.400 
30.300 
34.000 
50.300 
28.700 
102^00 
52.800 



345 
346 
347 
348 
348 
350 
351 
352 
353 
354 
355 
356 
357 
358 
356 
360 
361 
362 
363 
364 
365 
366 
367 
366 
369 
370 
371 

372 

373 

374 

375 

376 

377 

378 

379 

381 

382 

383 

384 

385 

386 

387 

388 

389 

390 

391 

392 

393 

394 

395 

396 

397 
399 
400 
401 
403 



405 

406 

409 

410 

411 

412 

413 

415 

416 

417 

418 

419 

420 

421 

422 

423 

424 

425 



1006 
1095 
625 
361 
110 
521 
912 
1574 
961 
706 
1450 
1374 
474 
796 
764 
1384 
1713 
1161 
914 
412 
741 
878 
1560 
963 
434 
639 
1567 
1875 
1351 
1506 
1823 
254 
1409 
621 
1017 
953 
856 
12S2 
1699 
1042 
1490 
1554 
1193 
1374 
1456 
718 
1799 
1482 
1227 
1530 
1410 
912 
1465 
1473 
1029 
1516 
1495 
1525 
723 
650 
1501 
936 
350 
1033 
737 
1578 
646 
1695 
725 
1289 
1171 
599 
929 
739 
1490 



578 
640 
728 
963 
1343 
1130 
619 
530 
912 
762 
830 
1152 
997 
346 
338 
1066 
769 
859 
1156 
435 
486 
1503 
935 
520 



610 
860 
762 
1059 
715 
532 
417 
583 
494 
595 
598 
674 
258 
1518 
493 
583 
603 
404 
902 
969 
690 
732 
758 
1461 
577 
755 
256 
1063 
450 
1140 
754 
554 
1092 
252 
663 
478 
1057 
1120 
538 
425 
606 
496 
462 
770 
1041 
912 
162 
656 
625 
965 



-11.9 
-10.3 
-21.7 
-35.3 
<-35.0 
-26.7 
•13.9 
-3.7 
-12.9 
-18.9 
•5.3 
-6.5 
-28.7 
-16.3 
-17 J 
-64 
-2.1 
•9.3 
-13.8 
•32.0 
-17.9 
•14.6 
•3.9 
-12.4 
-31 A 
-21^2 
•3.6 
-0.5 
-6.8 
•4.6 
-0.9 
<-35.0 
-6.1 
-21.8 
-11.7 
-13.1 
-15.0 
-6.1 
-2-3 
-11.2 
•4.7 
-4.0 
-8.9 
-6.5 
-5.2 
-18.5 
•1.1 
-4.6 
-64 
-4.3 
-6.0 
-13.9 
-5.0 
-4.9 
-11.5 
-44 
-47 
-4.3 
-18.4 
•20.8 
-4.6 
-13.4 
-35.0 
-11.4 
-18.0 
-3.7 
-21 .0 
-2.3 
-18.3 
-7.7 
-9.1 . 
-22.8 
•13.6 
-17.9 
-4.7 



50.800 
46.800 
42.000 
31.100 
16.300 
25.700 
48.100 
54.300 
33.900 
40.400 
37.300 
24.900 
30.600 
77.800 
79.400 
27.900 
40.100 
36.100 
24.800 
63.700 
56.200 
13.000 
33.000 
55.200 

ss.ooc 

46.700 
36.100 
40,400 
28.300 
42.700 
54.200 
65.900 
50.400 
57.500 
49.600 
49.400 
44.900 

105.300 
12.500 
57.500 
50.400 
49.100 
67.700 
34.300 
31.700 
44.000 
41,900 
40.600 
14,400 
50.800 
40.800 

106.400 
28.100 
61.900 
25.300 

40.800 

52,500 

27.100 

106,000 

45.500 

59.000 

28.300 

26.000 

53.700 

64,900 

48.900 

57.300 

56.600 

40.000 

28.900 

33.900 
193.700 

36.200 

47.700 

31.600 




426 
427 
428 
426 
430 
431 
432 
434 
435 
436 
437 
436 
439 
440 
441 
443 
446 
447 
448 
449 
450 
451 
452 
453 
454 
456 
457 
450 
460 
461 
462 
463 
464 
465 
466 
466 
469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
482 
463 
485 
486 
487 
488 
489 
490 
491 
492 
493 
494 
495 
496 
497 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 



1 

810 
1565 
1250 
1253 
734 
483 
518 
1020 
1122 
1870 
435 
86 
1740 
599 
743 
801 
1050 
1245 
1576 
1818 
1094 
1945 
1652 
1403 
1394 
■ 905 
1038 
1596 
1528 
1098 
849 
1814 
1388 
1194 
577 
1140 
1797 
1293 
616 
2009 
1205 
1035 
160 
469 
599 
1009 
1216 
816 
683 
1606 
478 
1025 
1045 
1609 
775 
692 
1100 
1760 
882 
470 
494 
980 
1414 
1234 
1246 
824 
1246 
1115 
1189 
1578 
787 
979 
1153 
1730 



704 
843 

303 
847 
562 
1426 
433 

1041 

1170 
196 
673 

1102 
847 
544 

1571 
335 
666 
926 

1298 

1516 

1021 



802 
894 

500 
718 
436 
581 
294 
863 
1137 
1125 
1072 
481 
1084 
467 
688 
524 
1133 
655 
299 
215 
788 
155 
1370 
662 
540 
235 
346 
673 
1013 
599 
607 
1186 
301 
1289 
178 
964 
776 
247 
1256 
1436 
852 
546 
1072 
659 
792 
1134 
1407 
391 
402 
250 
552 
619 
1006 



7.t> 
16.0 
-3.9 
-6.0 
-8.1 
-18 1 
•285 
26.9 
•11.6 
-98 
-0.5 
•31.0 
<-35.0 
-1.8 
•22.8 
-17.8 
-16.2 
■11.1 
-8.2 
-37 
-0.9 
-10.3 
>0.0 
-2.8 
-6.1 
•6.3 
-14.0 
-11.3 
34 
-4.3 
-10.2 
-15.2 
-0.9 
-6.3 
•8.9 
•23.9 
-9.6 
-1.1 
-7.6 
-21 .9 
>0.0 
•67 
-114 
<-35.0 
•28.9 
-22.8 
-11.8 
-8.6 
-15.9 
•19.3 
-3.3 
-28.6 
•11.5 
-11.2 
^3.3 
-17.0 
19.3 
•10.2 
-1.6 
-14.5 
•26.9 
-28 1 
-125 
-6.0 
-83 
•8.2 
-157 
•8.2 
-9.9 
4.9 
-3.7 
-16.6 
•12.5 
-94 
-20 



*1.80c 

2a sex 

2«3CC 

<5.0Ct 
26.?tt 
36.6CC 

10.8X 
80. tQ- 
45.2QC 
33.30C 
19B0C 
12.6K 
29.8X 
63.1X 

3ft ax 

56 OX 

42.800 
63.SK 
50.500 
91 .4X 
35.9X 
254X 

25 ax 

27.800 
58.700 
27300 
60.100 
34.900 
54.800 
25.500 
46.000 
89.900 

131.300 
39 20C 

207.60C 
17400 
45.600 
53.500 

117.400 
77.800 
44.000 

30.000 

40.300 

48.800 

23.700 

89.200 

20.100 
169.300 

31.800 

39.700 
110.700 

21.200 

1SJB0 

36.400 

5X1® 

27.80D 

4SJ0D 

39.O0C 

2&5DC 

16J0O 

88l» 

68,000 

ica<*» 

52.6* 
46.100 
3OJ00 
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511 


AN 


AUA 


512 


1099 


S33 


513 


1696 


1034 


514 


048 


636 


515 


481 


543 


516 


1334 


1044 


517 


868 


1021 


518 


796 


779 


519 


822 


670 


SO 


632 


165 


571 


1332 


830 


522 


603 


1104 


523 


1190 


309 


524 


479 


1226 


525 


768 


1066 


526 


747 


1016 


527 


1170 


231 


52B 


1502 


542 


530 


1728 


620 


532 


507 


1011 


533 


870 


469 


534 


1347 


1065 


535 


1513 


346 


536 


308 


654 


538 


1851 


669 


538 


1463 


962 


540 


909 


561 


541 


625 


289 


542 


1164 


196 


543 


803 


655 


544 


1259 


1143 


545 


856 


1526 


546 


803 


1071 


547 


1162 


274 


548 


128 


1321 


540 


1355 


1122 


550 


595 


866 


552 


1369 


494 


553 


992 


405 


555 


1125 


410 


566 


705 


975 



557 1477 
358 960 
559 700 



562 
564 
'565 
566 



570 
571 
573 
574 
575 
576 
577 
578 



581 
562 
564 
565 
586 
567 



1030 
583 
1109 



»1 
592 
593 
304 



1028 


621 


898 


794 


789 


1446 


777 


766 


960 


328 


1519 


611 


1212 


661 


760 


594 


618 


956 


1142 


771 


532 


787 


771 


250 


1068 


534 


822 


734 


914 


754 


1064 


794 


1S24 


714 


1392 


783 


982 


666 


1467 


672 


756 


731 


667 


1152 


930 


523 


1888 


774 


642 


465 


1317 


519 


65 


1548 


1014 


614 


732 


176 


1627 


478 


1009 


1426 



-16.0 
•10.2 
-25 
-13.2 
•28.5 
-7.1 
-14.8 
-16.3 
-15.7 
-21.5 
-7.1 
-22.6 
-69 
-28.6 
-175 
-17.7 
-95 
-4.6 
-2.0 
-27.4 
-14.7 
-6.9 
-4.5 
<-35.0 
-0.7 
•5.1 
-13.9 
-21.7 
-9.2 
-16.2 
-8.0 
-15.0 
-165 
-9.3 
<-35.0 
-6.6 
•23.0 
-6.6 
-125 
•9.6 
•16.9 
•4.9 
•12.5 
-19.1 
-11.5 
-14.1 
•16.6 
-16.9 
•12.5 
•4 4 
-8.6 
-17.4 
-21 .9 
-9.6 
-265 
-17.1 
-10.8 
-157 
-13.8 
-10.8 
-44 
-6.3 
-12.4 
-4.8 
-17.4 
-19.5 
-13.5 
-0.4 
-21.1 
-7.3 
c-35.0 
-11.7 
-18.1 
•3.0 
-11.6 



58.400 
54.100 
29500 
47.100 
53.400 
28.800 
29.700 
39.600 
45.100 
169.000 
37.300 
26.600 
86.800 
22.300 
28.000 
29.600 
119.600 
53.400 
46.000 
30.000 
57.900 
27.300 
77.800 
46.000 
44.100 
31.100 
52.000 
93.100 
146500 
45.900 
25.200 
12500 
27.800 
98.400 
19.000 
25.900 
35.800 
57.500 
67.600 
66.900 
31.400 
29.300 
50.400 
26.400 
48.000 
38.900 
14.900 
40500 
81.900 
48.600 
45.600 
49.700 
32.100 
40.000 
39.300 
109500 
54.100 
41.800 
40.800 
38.900 
42.800 
39.400 
44500 
45.000 
41.900 
24.900 
56.000 
39.900 
58.300 
55.300 
11.500 
46.400 
17Z3O0 
59.000 
15.500 



USN 



Y CPKo* S0SMW 



507 
596 
509 

600 
601 
602 
603 
604 
60S 



619 
1176 



461 



1465 1044 
741 1168 



907 
667 



4C2 
658 



712 1138 
898 181 



783 
736 

606 629 

607 1064 

608 863 

609 2012 

610 12S5 
612 1103 



613 
614 



778 
•824 



615 1095 

616 1759 



617 
618 
619 



994 
751 
1429 



620 1050 

6?1 923 

622 1462 

623 759 

624 756 

625 1438 

626 1096 

627 942 



1461 
223 
273 
286 
503 
610 
903 
391 
265 
518 
195 
478 
372 
374 
516 
520 

1 1AC 
622 
225 

1038 
606 

1069 
546 



62S 


809 


621 


629 


899 


979 


630 


1135 


1321 


631 


979 


615 


632 


1542 


1076 


633 


1345 


814 


634 


409 


950 


635 


1165 


704 


636 


774 


604 


637 


1263 


524 


638 


952 


411 


639 


1717 


575 


640 


994 


292 


641 


165 


1224 


642 


803 


251 


643 


719 


296 


644 


1100 


294 


645 


534 


1263 


646 


1153 


1038 


646 


1246 


204 


649 


14 


1406 


650 


1713 


1049 


651 


1986 


1183 


652 


1378 


616 


653 


1442 


1165 


654 


650 


806 


655 


1111 


551 


656 


1095 


861 


657 


1524 


540 


656 


1777 


860 


659 


391 


584 


660 


977 


565 


661 


656 


166 


662 


732 


312 


663 


1787 


567 


664 


886 


268 


665 


889 


775 


666 


715 


221 


667 


781 


227 


666 


646 


165 


669 


1116 


353 


670 


1382 


643 


671 


547 


789 


673 


964 


746 



-21.9 
-9.1 
-5.0 
-17.9 
-14.0 
-19.5 
-18.7 
•14.1 
•16.7 
-18.0 
-21.6 
•10.8 
-14.5 
>0.0 
-8.1 
-10.1 
-16.9 
-15.7 
•10.3 
-1.6 
-12.1 
•17.6 
•5.7 
-11.1 
-13.7 
-5.1 
•17 4 
-17.4 
-5.5 
-10.2 
-13.3 
-16.0 
-14.1 
-9.6 
-12.5 
-4.1 
. -6.9 
-32.2 
-9.2 
•17.0 
■8.0 
•13.1 
-2.1 
-12.1 
<-35.0 
•16.2 
-18.5 
-10.2 
-26.1 
-94 
-8.2 
<-35.0 
-2.1 
>0 0 
-6.5 
-5.5 
-20.8 
-10.0 
-10.3 
-4 4 
-1 .4 
-33.4 
-12.5 
-20.5 
-16.1 
-1.2 
•14.4 
-14.3 
-18.6 
-16.8 
-21 .0 
-9.9 
-64 
-25.3 
-124 



100.500 
60.700 
28.800 
23.600 
66.000 
45.800 
25.400 
165500 
14.400 
125.300 
96.700 
94.000 
56.700 
48.700 
34.200 
69.600 
102.000 
55.400 
149.100 
59.000 
72.900 
72400 
55.300 
55.200 
26.600 
47.900 
124,000 
29.000 
48.900 
27.200 
53.000 
46.000 
31.300 
19.100 
48.300 
27.600 
38.000 
32.400 
43.300 
49.000 
54.800 
66.700 
51,000 
92.000 
22.400 
106.900 
90.700 
91.400 
21,000 
29.000 
140.000 
16.200 
28.600 
23.800 
38.000 
24.400 
38.400 
52.700 
36.000 
53.600 
36.000 
50.400 
51.700 
187.500 
86.100 
51.500 
100,900 
39.800 
126.300 
122,400 
189.100 
76.300 
46.600 
39.200 
41.200 



MSN 



Y CPKtf SOSMW 



674 


1661 


448 


675 


1523 


562 


676 


708 


JU*> 


677 


010 


©13 


678 




331 


679 


wu 


so 


680 


19*17 


• CXM 


681 


1 1 fr\ 
1 IUJ 


283 


662 




AT7 


663 


1596 




664 


555 


ADO 


685 


1167 


1313 


686 


1932 


790 


667 


1545 


619 


688 


1456 


764 


689 


1011 


953 


690 


1995 


270 


691 


812 




692 


1154 


1461 


693 


1993 


619 


694 


1628 


656 


695 


928 


254 


696 


1854 


715 


697 


1997 


345 


696 


957 


563 


699 


1540^ 


730 


702 


577 


900 


703 


1610 


562 


705 


1278 


571 


706 


1841 


704 



707 
709 
710 
712 



1018 1386 

1074 1145 

293 889 

720 



713 1386 

714 1328 



715 
716 



696 
701 



717 1875 

718 575 

719 1216 

721 1069 

722 1272 

723 956 



724 

725 



763 
720 



726 1476 

727 1846 

728 510 

729 1217 

730 1856 

731 665 

733 1321 

734 719 

735 1101 

736 1359 



738 
739 



696 
687 



740 1205 



741 
742 
743 
744 
745 
746 
746 



995 
896 
681 
1951 
726 
999 
182 



412 
841 

263 
433 
481 
699 
702 
204 
464 
506 
822 
395 
916 
415 
473 
783 
1126 
724 
765 
312 
427 
473 
569 
220 
409 
256 
563 
596 
161 
686 
166 
643 



-2.7 
-4.4 
-18.8 
-13.7 
-10.5 
-22.7 
-83 
-10.1 
-6.1 
•34 
•24.6 
•9.2 
0.0 
-4.1 
-55 
-11.8 
>0.0 
-16.0 
-94 
>0.0 
-3.0 
-13.6 
-06 
>0.0 
-13.0 
-4.2 
-23.8 
•3.2 
-7.8 
-0.7 
•11.7 
•10.7 
<-35.0 
-18.5 
-6.4 
-7.1 
-19.1 
-19.0 
-0.5 
-23.9 
-8.6 
-10.8 
-7.9 
-13.0 
•17.3 
-18.5 
-4.9 
-0.7 
-27.3 
-8.6 
-0.6 
-205 
-75 
-18 5 
-10.2 
-67 
-19.2 
-19.5 
-8.7 
-12.1 
-14 1 
•14.5 
>00 
-18.3 
-12.0 



749 2005 

750 1446 



1503 <-35.0 
649 >00 



751 
752 
754 



792 
469 
664 



755 1195 

756 1821 



757 
760 



909 
790 



575 
266 
296 
254 
164 
1113 
246 
133 



-54 

-16.5 
-28.9 
-20.3 
-8.8 
-0.9 
-13.9 
-16.5 



62100 
51.900 
46.700 
48.300 
52.700 
33 400 
30.300 
95.100 
59.100 
109.800 
43.500 
19.300 
39.100 
46.100 
40.300 
32.300 
100,200 
34.900 
14.400 
37.800 
45.900 
107,000 
42.700 
7R. rnn 
. 51.800 
42.000 
34.400 
51.900 
51.200 
43.300 
16.900 
25.100 
34.800 
66.600 
36.800 
103.100 
63.900 
56.700 
43,600 
43.400 
140.400 
60.400 
56.400 
37.700 
69.100 
33.700 
66.200 
59.400 
39.400 
25.800 
42.300 
40.300 
85.900 
64.600 
59.500 
51.400 
127.600 
67,000 
106.200 
51.900 
49.500 
165.900 
44.200 
163.600 
46.600 
13.000 
46,300 
51.000 
101.900 
90.600 
107.000 
161,000 
26.300 
111.000 
264 900 



926 



ttmL 



MSN 



Y CPKDl S06UW 



MSN 



V CPKDt SOSMW 



761 
763 
764 
765 
766 
767 
766 
768 
770 
771 
773 
775 
776 
777 
778 
779 
780 
764 
785 
786 
787 
790 
791 
792 
/vo 
794 
796 
797 
798 
799 
800 
801 
802 
803 
804 
805 
806 
807 
808 
809 
810 
811 
812 
813 
814 
815 
816 
817 
818 
819 
620 
621 
822 
823 
824 
825 
826 
827 
828 
830 
831 
832 
833 
834 
837 
838 
839 
840 
841 
842 
843 



1399 
1416 
2020 
651 
1052 
1968 
1130 
197D 
857 
1337 
1576 
9G9 
1438 
1539 
850 
700 
1052 
1413 
1364 
1822 
893 
616 
451 
777 



733 
1085 
569 
475 
1149 
468 
685 
613 
617 
974 
502 
824 
708 
458 
434 
411 
1136 
529 

eas 

835 
392 



1461 
388 

1126 
933 

1420 

1759 
624 



845 
846 

847 



1775 
573 
203 
980 
902 
625 
1851 
440 
1358 
851 
745 
2028 
1086 
629 
1376 
1771 
1045 
984 
1712 
1256 
1517 
1442 
1240 
1309 
2012 
937 
1342 
562 
1073 
481 
501 
751 
635 
1494 
1952 
1585 
571 
1325 
1727 
630 
2016 
673 



1429 
377 
1543 
807 
546 
212 
437 
593 
279 
865 
547 
1468 
196 
494 
1039 
308 
827 
1015 
573 
249 
393 
1246 
810 
645 
313 
1177 
790 
263 
362 
279 
205 
654 
449 
513 
1014 
708 
1405 
756 
626 
1039 
820 
581 
748 
633 
459 
301 
1080 
1312 
649 
301 
679 
905 
1200 



4.2 
-5 9 
>0.0 
-20.8 
-11.1 
>0.0 
-7.1 
>0.0 
•15.0 
•7.0 
-3.7 
-12.8 
-S.5 
-4.2 
-15.1 
-19.1 
•11.1 
•6.0 
-6.7 
-0.9 
-14.3 
-22.0 
•29.6 
-16.9 
-4.2 
-5.1 
-33.6 
-9.8 
-13.5 
-5.9 
-1.6 
-21.7 
-14.2 
-1.4 
-24.0 
<-35.0 
-12.5 
-14.1 
-21.7 
-0.7 
•30.9 
■6.8 
-15.1 
-17.8 
>0.0 
-104 
-21 .6 
4.5 
-1.4 
-11.2 
•124 
-2.2 
-8.1 
•4 4 

-5.5 

-8.3 

-74 

>0.0 
-134 

-7.0 
-24.5 
-10.7 
•28.5 
-27.8 
-17.6 
-21.3 

-4.7 
>0.0 

-3.6 
-24.1 

-7.2 

-2.0 
-21.5 
>0.0 
•19.9 



41.800 
27.300 
51.400 
59.300 
25.000 
59.900 
44.300 
46.500 
48.200 
31300 
56.700 
37.600 
43.100 
61.000 
63.800 
66.800 
25.500 
54.400 
35.000 
37.100 
69.500 
35.100 
15.400 
72.000 
11.700 
38.300 
53.100 
133.700 
63.400 
49.800 
96.500 
35.800 
53.000 
14.200 
148.400 
57.400 
29.000 
87.200 
37.500 
29.900 
51.100 
109.700 
69.400 
21.600 
38.200 
46.500 
85.700 
24.000 
39.100 
103.100 
74.600 
96.700 
139.200 
46.000 
62.000 
55.800 
29.900 
43.100 
16.200 
40.700 
37.500 
.29.000 
37.800 
50.500 
41.100 
37.200 
60.900 
89.300 
27.500 
19.400 
46.300 
69.200 
44.600 
34.200 
23,200 



849 
850 
851 
852 
855 
856 
657 
858 
859 
860 
861 
862 
864 
865 
866 
868 
869 
870 
871 

872 

873 

874 

875 

876 

877 

878 

679 

880 

681 

863 

884 

865 

886 

887 

888 



891 
892 
894 
895 
696 
897 



900 

901 

903 

904 

905 

907 

908 

910 

911 

913 

914 

916 

917 

919 

920 

921 

923 

924 

925 

926 

927 

928 

929 

931 

932 

933 

934 

936 

937 



1863 
1166 
1535 
1035 
834 
499 
1063 
887 
1448 
706 
1070 
472 
674 
1307 
645 
827 
685 
1807 
1323 
1228 
1904 
556 
1540 
1566 
1196 
1076 
1161 
647 
1756 
1543 
1432 
922 
1103 
1501 
798 
636 
951 
717 
1123 
691 
1245 
1962 
' 1322 
420 
662 
845 
624 
931 
799 
765 
775 
888 
828 
661 
1544 
1606 
1237 
1442 
1260 
764 
1133 
1123 
829 
1131 
1441 
679 
1487 
1082 
1231 
1609 
810 
965 
947 
865 
1421 



271 
523 
1024 
626 
542 
220 
194 
690 
639 
311 
1066 
347 
460 
499 
887 
1004 
494 
402 
783 
1031 
346 
647 
756 
777 

351 
720 

1111 
757 
594 
278 
890 
689 
414 
607 

1103 
634 
759 
548 

229 

413 

234 

346 

626 

570 

428 

243 

703 
1094 

229 

520 



824 
1303 
1544 
301 
387 
688 
749 
367 
1541 
1123 
380 
242 
318 
874 
219 
1191 
775 
816 
670 
900 
520 
462 
843 
1056 



-0.6 
-9.2 
-4.2 
-11.4 
-15.5 
-27.8 
•10.9 
-14.4 
•54 
•18.9 
•10.7 
•28.6 
•19.9 
-7.4 
•21.0 
•15.6 
•19.5 
•1.0 
-7.2 
-8.4 
-0.3 
-24.8 
-4.2 
?i * 

-8.8 
-10.6 
-9.3 
•20.9 
-1.6 
-4 1 
-5.7 
-13.7 
-10.1 
-4.6 
•16.3 
•21.3 
-13.1 
-18.6 
-9.8 
-14.3 
-8.2 
>0.0 
-7.2 
-31.4 
-20.3 
-15.3 
-21.7 
-13.5 
-16.3 
-17.2 
-17.0 
-14 4 
•15.6 
-19.7 
-4.1 
-3.3 
4.3 
-5.5 
-8.0 
-17.3 
-9.7 

•g.e 

-15.6 

-9.7 

-55 
-19.7 

-4.8 
•10.5 

•8.4 

•3.3 
-16.0 
•12.8 
■13.2 
■14.8 

-59 



99.500 
54.900 
29.600 
37.500 
63.400 
127.100 
150.500 
34.800 
46.900 
86.200 
28.000 
77.600 
56.800* 
57.000 
34.900 
30.300 
57.400 
66.000 
39.400 
29.300 
77.700 
46.400 
40.700 
33,700 
76.800 > 
42.500 
26.400 
40.700 
49.700 
97.100 
34.600 
44.100 
66.400 
48.900 
26.600 
47.200 
40.600 
52.900 
121.200 
66.400 
117.800 
77.700 
47.700 
51.300 
64.500 
113.000 
43,400 
27.000 
121.000 
55.200 
34.800 
37.600 
19.700 
11.700 
89,100 
70.400 
44.100 
41.100 
73.700 
11.700 
25.900 
71.500 
113.200 
84.300 
35.400 
128.200 
23.500 
39.800 
38.000 
45.100 
34.400 
55.100 
60,600 
36.800 
28.400 




939 
941 
942 
943 
944 
945 
946 
947 
948 
949 
950 
951 
952 
954 
955 
957 
959 
960 
961 
962 
963 
964 
965 



967 
966 
969 
970 
971 
972 
974 
975 
976 
977 
978 
979 
960 
981 
963 
964 
965 
987 
986 
990 
991 
992 
993 
994 
995 
996 
997 
996 
999 
1000 
1001 
1002 
1003 
1006 
1007 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1020 
1021 
1022 
1023 
1024 
1025 



1187 
1765 
602 
312 
993 
1300 
630 
187 
1380 
1766 
1038 
860 
957 
503 
1938 
1010 
768 
596 
557 
887 
564 
969 
671 
1204 
910 
609 
1265 
822 
976 
403 
279 

644 

1124 
994 

1612 
749 

1064 

1197 

1762 

1344 

1024 
739 
816 
785 

1159 

1090 

1030 

847 

902 

668 

1815 
1205 

617 

968 

970 
1736 

643 

822 

875 

291 
1386 

459 

679 
1818 
1032 
1629 
1311 
1722 
1015 
1574 

781 
1129 
612 
785 
1290 



827 
865 
472 
498 
491 
2G9 
423 
736 
344 
665 
193 
152 
701 
547 
712 
816 
174 
419 
409 
320 
334 
1156 
255 
796 
154 
1048 
206 
232 
437 
567 
495 
961 
295 
664 
642 
1141 
642 
911 
1506 
317 
1105 
1159 
555 
361 
317 
928 
701 
811 
461 
847 
579 
504 
289 
290 
771 
478 
1184 
487 
279 

644 . 

745 
541 
661 
1128 
634 
994 
1134 
424 
743 
1219 
484 
83 
317 
446 
739 



4.8 

•1.5 
-22.7 
<:35.0 
-12.1 
-7.5 
-21.6 
<-35.0 
45 
-1.5 
-11.3 
•14.9 
-13.0 
-27.6 
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•17.2 
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-24.5 
•12.8 
-20.0 
4.7 
-13.9 
-22.3 
-7.7 
-15.8 
-12.6 
•326 
<-35.0 
-15.3 
-9.8 
-12.1 
•3.2 
•17.7 
•10.8 
4.8 
-1.6 
4.9 
-11.5 
-17.9 
-15.9 
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-9.3 
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-11.5 
-15.2 
-14.1 
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•0.9 
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-22.0 
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-6.4 
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-0.9 
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•3.0 
-7.4 
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-3.7 
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-9.7 
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■16.7 
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91.100 
45.400 
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38.200 
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56.500 
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23.700 

58.100 
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46.600 

41.200 
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25.800 

47.200 
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41.300 
22.500 
58.400 

591.3°° 
84.800 
62.400 
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856 
1284 
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1547 
1381 
1525 
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1761 

541 
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1036 
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• v • ■* 
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393 
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836 
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1697 
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•15.0 
-7.7 
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68.300 
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36.600 
34.000 
56.300 
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47.100 
66.700 
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37.800 
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48.000 
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45.500 
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46,600 
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39.000 
33.000 
41.800 
45.800 
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36.000 
51.600 
58.500 
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38.900 
34.000 
49.500 
34.600 
53.700 
59.100 
33.000 
116.000 
26.600 
45.200 
38.800 
54,200 
46.300 
53,100 
42.400 
28.000 
48.000 
40,400 
38.000 
39.300 
33.100 
27.600 
48.300 
19,700 
44.700 
61.700 
36.200 
38.600 
34,700 
37.500 
51.400 
23,800 
42.300 
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1154 

1161 

1162 

1163 

1168 

1170 

1171 
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1177 

1178 
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1194 

1195 

1196 
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1199 
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1201 
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1203 
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1205 
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1212 
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1215 

1216 
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1504 
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564 
552 
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1304 
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1407 
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614 
603 
696 
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475 
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1411 
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794 
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663 
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514 
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586 
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5.76 
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4.71 

4.66 

4.61 

4.56 
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4.48 

4.44 

4.40 

4.36 

4.32 

4.29 

4.25 

4.22 
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Table 4. Computed pfi of some tao«n proteins reUud 10 meisured CPK p/s 



Protein Name 

0 Creatine phospno kinase (CPK). rabbit muscle 

1 Fatty acid-binding protein, rat hepatic 

2 b2*microglobubn. human 

3 Carbamoyt-phosDhate synthase, rat 

4 Prealbumin ( serum albumin precursor), rat 

5 Serum albumin, rac 

6 Superoxid disminase (Cu-2n. SOD), rat 

7 PhosphoUpase C. phophoinosmoe-specific (?), rat 
6 Albumin, human 

9 Apo A-l lipoprotein, rat 

" 0 pro Apo A-l lipoprotein, human 

* 1 NADPH cytochrome P-450 reductase, rat 

* 2 Retinol binding protein, human 
' 3 Actin beta, rat 

u Actin gamma, ra: 

1 5 Apo A-l lipoprotein, human 

1 6 Apo A-IV lipoprotein, human 

17 Tubulin alpha, rai 

18 FlATPase beta, bovine 

1 9 Tubulin beta, pig 

20 Protein disulphide isomerase (PDI). rat hepatic 

21 Cytochrome b5. rat 

22 Ado C-ll liooorotein. human 

Amino aaC pi assumeo m calulation: 
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6.B4 
7.83 
6.09 
5.97 
5.96 
5.71 
5.91 
5.92 
5.70 
5.32 
5.35 
5.07 
5.04 
5.06 
5.07 
5.10 
4.66 
4.66 
4.80 
4.49 
4.07 
4.59 
4.44 
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An updated two-dimensional gel database of rat liver 
proteins useful in gene regulation and drag effect 
studies 

mw h Sf «M ? ^ Ved UP ° n lhC refCrCnCe tw °- di ™™onal O-D) electrophoretic 
map of rat live proteins originally published in 1991 (N. L Anderson et al 

MmSS^v 1 ? 1 ' 901 ' mi A ° f53 Pr0lCinS 002 spols) « 
n^e iu^l b * ™™? uen «ng. In most cases, spots cut from wet, Coo- 

TnH nn "* nc < 2 " D * eIs wcre submitted to internal tryptic digestion [21 

SpJS 1 PepUdcS ' " Paraled by W*h-Perfonh.ncc liquid chromatographv 

Ss wer? L S ^r UC H nCCd USmg 2 r Perkin ' Elmcr sequenator. Additional 
spots were identified using specific antibodies. 



Figure 1 shows the current annotated 2-D map of F344 
rat liver, analyzed using the Iso-DALT svstem (20 X 25 
cm gels) and BDH 4-8 carrier ampholytes. Both the 
map itself and the master soot numher cvcta^ 

, • * villain 

tfte same as shown in the original publication. Table 1 
lists the important features of each identification shown 
including the gel position, p/. and A/, for the most' 
abundant or most basic form of each protein. Using this 
extended base of identified spots, a series of four 
improved calibration functions has been derived for the 
p/and SDS-M, axes (the first two of which are shown in 
Fig. 2A and B). Both forward and reverse functions are 
derived, so that one can compute the physical properties 
of a spot with a given gel location, or inversely compute 
the gel position expected for a protein having given 
physical properties: 

>RATLIVER = /^RATL!V£R , (Af f SE0LEVCE . 0ERIVCD ) (]) 
^RATLIVER = /p»-RATllV E R X (P'sEOL-ENCE-DERrVEo) (2) 
GEL-DERIVED = /raT LIVER Y-M r ( *RaTLIV E r) (3) 
P'oEL-DERfVED = ZraTUVEA X-pl (X%aTLJVZ$) (4) 

A spreadsheet program (in Microsoft Excel) was devel- 
oped to facilitate flexible computation of pfs from 
ammo acid sequence data, and the results were entered" 
into a relational database (Microsoft Access). A table of 
spot positions and sequence-derived pi's and M y s was 
fitted with a large series of analytic equations' using 
Tablecurve (Jandel Scientific), and the four conversion 
tqs. (1M4), relating computed p/and gel X coordinate 
or computed molecular weight and gel r coordinate' 
were selected, based on criteria of simplicity, goodness 
or in and favorable asymptotic behavior. Table 2 lists the 
equations and coefficients. Application of Eqs. (3) and 
(4) to a spot's X and Y coordinates, given in [1], produce 
improved M, estimates, and allow computation of p/ 

. COm £™ d w" : Df ' Uigh ^t™*' ^rge Scale Biology Corpora- 
iim^ «« Ccmtf Drive - Ro * v "'«. MD 20850-3338 USA (Tel: 
+30M24-5989; Fix: +301-762-1892; email: leigh lsbc.com) 

/ Ke M, 0n! / : 7r°- d ; mensional POlyacrylamide gel electrophoresis / Liver 
/ Map / Identification / Calibration 

© VCH VcriagjgcenKbift mbH. 6945! Weinheim, 1995 



directly m pH units, instead of in terms of positions rela- 
tive to creatine phosphokinase (CPK) charge standards 
The inverse Eqs. (1) and (2) were used to compute the 
gel positions of a series y ot p/ and M r tick marks. These 
tick marks were plotted with SigmaPlot (Jandel), 
together with fiducial marks locating several prominent 
spots, and the resulting graphic was aligned over the syn- 
thetic gel image (computed by Kepler from the master 
gel pattern) using Freelance (Lotus Development). Maps 
were printed as Postscript output from Freelance, either 
in black and white (as shown here) or in color, where 
label color indicates subcellular location (available from 
the first author upon request). We have also used the rat 
liver 2-D pattern as presented here to calibrate the pat- 
terns of other samples. Using mixtures of rat liver and 
mouse liver samples, for example, we made composite 
2-D patterns that allow use of the rat pattern to standar- 
dize both axes of the mouse pattern. This was accompli- 
shed by deriving transformations relating the fat and 
mouse X, and separately the rat and mouse Y y axes 
(Table 2, lower half; Fig. 2C and D) based on a series of 
spots that coelectrophorese in these closely related spe- 
cies. These functions were then applied to derive equa- 
tions relating the mouse liver X and Yto p/and SDS-A/ f 
(Eqs. 5 and 6 below). The resulting standardized 2-D pat- 
tern for B6C3F1 mouse liver is shown in Fig. 3; 



OUSE LIVER — /rat LIVER Y— Mr OmOUSE LIVER Y-RaT LIVER Y 

(^MOUSE LIVEr)) (5) 

= /r> 



P/. 



MOUSE UVER — 7RATLIVER X-pl (/MOUSE LIVER X-RaTLIVER X 

C^mouse liver)) (6) 

A slightly more complex approach can be used to stand- 
ardize samples that have few or no spots, co-electropho- 
resing with rat liver proteins. In this case, a 2-D gel is 
prepared with a mixture of the two samples, and four 
functions (forward and backward, each for X and Y) are 
derived relating each sample's own master pattern to the 
composite. The required functions are then applied in a 
nested fashion to yield the desired result (using rat 
plasma as an example): 

^r»ATfLASMA = /raT UVER Y- Mf (/raT PLASMA ♦ UVER Y-RATUVER Y 

(/RAT PLASMA Y*RAT PLASMA ♦UVER Y ( ^RaT PLASM a))) 
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/Vgiffe /. Master 2*D gel pattern of Fischer 344 rat liver proteins, annotated with 53 protein identifications and computed p/ and M T axes. 
Tentative identifications are in italic type. 



Table 1. Proteins identified in the 2-D pattern of F344 rat liver 



MSN' } 


Protein ID b ' 


Protein name 


Identification comments 


Geir 1 


Experimental 
p/" 


Gel r' 


Experimental 
A/," 


126 


HADO-HUMAN"* 


3-HA-3.4-DO: 3-hydroxy- 
anthraoila te -3 ,4-d ioxy- 
genase 


Internal sequence 


871.95 


5.36 


92135 


30 207 


137, 159, 288, 


D1DH.RAT 


3HDD: 3-hydroxysteroid 


Ab (T.M. Penning) and pure protein 


1857.52 


6.51 


822.52 


34 406 


258 




dihydrodiol reductase 












173 


MUP.RAT 


a : u globulin 


Presence in liver microsome lumen, 
abundance in kidney, p/, M t 


919.16 


5.43 


1313.81 


19 549 


38 


ACTB.HU MAN 


Aciin 0 


Analogy with other mammalian patterns 
(e.g. human) through coelectro phoresis 


763.40 


5.19 


693.64 


41 586 


68 


ACTG.HUMAN 


Actin y 


Analogy with other mammalian patterns 
(e.g. human) through coelectrpphoresis 


779.42 


5J1 


692.26 


41 677 


693 


AfAR.RAT 


Anatoxin Bl aldehyde 
reducusc 


Internal sequence 


1993.32 


6.72 


818.60 


34 593 


28, 21. 33 


ALBU.RAT 


Albumin 


Coelectrophoresis with principal plasma 
protein 


1262.81 


5.86 


445.64 


66 354 


43 


dham.rat 


Aldehyde dehydrogenase 


A'-Terminal sequence and AAA 


1317.72 


5.91 


589.03 


49 602 


96 


ARG1.RAT 


Arginase 


Internal sequence 


1730.72 


6J4 


756.02 


37 819 


117 


SUAR.RAT 


Arylsulfotransferase 


Internal sequence 


1547.96 


6.14 


849.08 


33 186 


1163. 1161. 


CR78.RAT 


BIP (GRP-78) 


Ab (F. Witzmann) 


665.33 


5.01 


397.39 


74 564 


1162,20 
















185 


CAH3.RAT 


CA-III 


Uncertain: by comparison with mouse 


1996.60 


6.72 


1017.02 


26 887 


123 


CALM .HUMAN 


Calmodulin 


Analogy with human cellular patterns 
through coelectrophoresis 


23.05 


4.03 


1433.25 


17 419 


3, 201, 48, 39, CRTCJUT 


Calreticulin 


Ab (Lance Pohl) 


310J9 


4J4 


433.80 


68 206 


22, 24 
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Table 1. continued 



3-D Database of rat liver proietns 1979 



MSN 1 ' 



Protein IDb) Protein nunc 



Identification comments 



Gel X*' Experimenul Gel r' Experimental 



1184, 1186. CPSM.RAT 
114. 174, 118 
5, 167, 157 



54.61 
136 

87 

41 
29 

5. 11 

60 

27 

17 

196 

79 

62. 78 
125 

307 

413, 1250, 
933 



8. 23. 1307 
15, 25, 110 
971 

1216, 1215.! 
256 

415. 734 

80 

227 

134 

18, 35, 226 

175, 251 
1168. 1170. 
1171 
47, 93 

236 
320 

152 

1179, 1180, 
1181. 1182, 
1183 
55. 103 

135 

172 

277. 56 
50, 1225 
1224 



CATA.RAT 
COX2.RAT 

CYB5.RAT 

CK-RAT" 

CK-RAT*' 

ENPL-RXT 0 

ENOA.RAT 

ER60.RAT 

ATPB.RAT 

ATP7.RAT 

F16P.RAT 

DHE3.RAT 
HAST- RAT* 



Carbarn y] phosphate 
synthase 

COX-II 

Cytochrome B5 

Cytokeratin 
Cytokeratin 
Endoplasmic 
Enolase A 
ER-60 

Fl ATPase 0 
Fl ATPase 6 

Fructose- 1 .6-bis-pbosphatase 

Glutamate dehydrogenase 
HAST-1: N-bydroxyaryl- 
amine sulfb transferase 



2-D of pure protein; comfirrned by 1453.56 6.05 
Miermirul sequence and AAA 

Internal sequence 2000.81 6.73 

Ab (J. W. Taanman), confirmed by 45237 4.61 

internal sequence 

2-D of pure protein; Ab; confirmed 515.68 4.73 

by AAA 

Location in cyioskeletaj fraction 1165.12 5.75 

Location in cyioskeletaj fraction 743.11 5.15 

Ab (F. Wiumann) 567.73 4.83 

Internal sequence and AAA 1399.78 6.00 

^-Terminal sequence (R. M. Van Frank) 1184 JO 5.77 

^-Terminal sequence and AAA 629.06 4.95 

Internal sequence 1227J4 5.82 

Uncertain; by comparison with ID in 924.54 5.44 
Garrison and Wiger (JBC 257:13135-13143) 

^-Terminal sequence and internal sequence 188739 6.55 

Internal sequence 1297.94 5.89 



181.64 160 640 



499.64 
1062.67 



58 968 
25 504 



1370.55 18 493 



569.09 
605.23 
263 .37 
62334 
523.51 
588.83 
1184.65 
737.77 

566.92 
86135 



51 448 

48 187 
112 194 
46 674 
56.169 

49 620 
22 310 
38 858 

51 655 
32 638 



HOl.RXT 


Heme oxygenase 1 


Uncertain; available data from interna] 


121939 


5.81 


915.7! 


30 423 


HMCS.RAT 




sequence 


V 








HMG CoA synthase. 


Ab (J. Germershausen) 


1033.48 


539 


538.13 


54 571 




cyiosolie 












HMCS.RAT 


HMG CoA synthase. 


Ab (J. Germershausen), ^-terminal 


666.40 


5.02 


1019.42 


26 811 


HS7C.RAT 


mitochondrial (frag) 


sequence (Steiner/Lottspeich) 










HSC-70 


Positional homology- (with human, etc) 


811.87 


5.27 


425.76 


69 521 


P60.RAT 


HSP-60 


through coelectropboresis 










Ab (F. Witzman); confirmed by AMerminal 


845.09 


532 


520.03 


56 561 






sequence and AAA 










HS70-RAT 1 


HSP-70 


Ab (F. Witzman) 


976.11 


531 


43734 


67 674 


1 HS90-RAT' 


HSP-90 


Ab (F. Witzman) 


659.86 


5.00 


329 


90 107 


1NGI-HUMAN 


Interferon-y induced 


internal sequence 


993.85 


534 


1006.04 


27 237 


LAMB-RAT 0 


protein 












Tjmin g 


Positional homology with human through 


737.10 


5.14 


425.19 


69 615 


LAMR-RAT*' 




coelectropboresis, nuclear location 










*Laminin receptor* 


Internal sequence 


534.02 


4.77 


697.62 


41 327 


FABL.RAT 


L-FABP (liver fatty acid 


Ab (N. M. Bass) 


1586.09 


6.18 


1483.43 


16 £22 




binding protein) 










MDHC.MOUS 
E 


Malate dehydrogenase 


Internal sequence 


1270.85 


5.86 


861.96 


32 620 


GR75-RAT' 


Mitcon3; grp75 


Positional homology with human through 


905.67 


5.41 


413.67 


71 589 


NCPR.RAI 




coelectropboresis 










NADPH P450 reductase 


2-D of pure protein 


824.69 


5.29 


39331 


75 366 


PDI.RAT 


PDI: Protein disulfide 


^Terminal sequence (R. M. van Frank), Ab 


56430 


4.83 


528.47 


55 618 




isomerase 












ALBU.RAT 


Pro-Albumin 


Microsomal lumen location, p/, AY, relative 


1391.03 


5.99 


446.68 


66 195 






to albumin 










APAI.RAT 


Pro-APO A-I lipoprotein 


Coelectropboresis with plasma protein 


920.41 


5.43 


1137.51 


23 467 


IPKl.BOVIN 


Protein kinase C inhibitor 1 


Internal sequence; homology with bovine 


1480.01 


6.08 


1458.81 


17 007 






protein 










PNPH.MOUSE Purine nucleoside 


Internal sequence 


1507.19 


630 


911.16 


30 599 


PYVC-RAT 0 


pbospborylase 












Pyruvate carboxylase 


Tentative; 2-D of pure protein (J. G. 


1485.10 


6.08 


22332 


131 589 






Henslee, JBC t 1979); reported in Biochim. 










SM30.RAT 




Biophys. Acta 1022, 115-125- 










SMP-30: Senescence 


lnternaJ sequence 


721.71 


5.11 


830.10 


34 051 


SODC.RAT 


marker protein-30 












Superoxide d ism u use 


AAA; comfirmed by internal sequence 


1161.24 


5.74 


1388.68 


18 173 


TPM-RAT*' 




(R. M. Van Frank) 










Tin: tropomyosin 


Location in cytoskeleton, 2-D position 


476.24 


4.66 


957.86 


28 865 


TBA1.RAT 




relative to human, Ab 










Tubulin a 


Positional homology, with human through 


688.22 


5.06 


537.67 


54 620 


TBB1JUT 




coelectropboresis, cyioskeletaj location 










Tubulin 0 


Positional homology with human through 


621.29 


4.93 


. 535.48 


54 855 


VIMEJUT 




coelectrophoresis, cytoskeleul location 










Vimeotin 


Posiional homology with human through 


673.00 


5.03 


53930 


54 426 






coetectrophoresis, cyioskeletaj location 
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Table 1. continued 



MSN* 1 


Proidn IDb) 


Protein name 


Identification comments 


Gel JT 


Experimental 


Gel r } 


Experimental 
•Af." 


113 


Unknown 


?: not is sequence 


Internal sequence 


1191J8 


5.78 


680.42 


42 469 






databases 












104 


BBPL.RAT 


23 tDi morphine-binding 


Internal sequence 


773.31 


5.20 


1182.41 


22 363 






protein 













b) SwissPROT identifier 

c) Coordinates of the most basic or most abundant assigned spot on the F344 master ge! pattern 

d) pi and M, of the most basic or most abundant assigned spot, derived from the calibration functions included here 

e) SwissPROT sryle proposed identifier 
Abbreviations: AAA. amino acid analysis; Ab, antibody 



Table 2. Equations and coefficients 



Function 



Equation (0 



r2 



Rat gel Y - ftcomputsd .l/.i v - a Aexpf-x/e) 0.988181021 178.74803 1967.7892 32363.958 

Rat ge! X « flcompmed ph y « a -r bx - ex/lnx - dlx + e/x 1 - 3 0.99247216 -8685665.5 -$04497.94 3856926.1 

Computed M t « ftrai gel Y) v « 0 -r bxc 0.9960177 -8464.5809 19095881 -0.9086255 

Computed p/ « flrai gel X) ym a + bx + ex 3 + rfx 3 lax + ex 3 0.99176499 4.044686 -0.001 J4238 0.0000323 

Mouse gel X « ftnt gel JO >• « o + ix •*■ cx 3 * + rfx" lax + 

a/lax 0.99951069 11861.44 678.91666 -0.789649U 

Mouse gel X - Aral gel X) y-c + bJlnx+cx^ + dx 3 0.99926349 58.935923 0.00091353 ^0.000213688 

Rat gel Y - ftroouse gel Y) y «= a + bx 3 lax + cr 13 + dx 3 0.99950032 69.740526 0.00050772 -0.000130392 

Rat gel X - fimouse gel AT) > - « * Ax + ex 3 lax + rfx" + ex 3 0.9992832 -198.07189 2.0899063 -0.000671191 



18276844 -27154534 



-0.00000455 0.O0OO00O01 76 



1567.S639 
0.00000159 

0.00000 n 6 

0.000145189 



-6953.9592 



-0.000000986 



y^+bx+cx/lnx+d/x+e/x^l .5) 




B 



y=a+bexp(-x/c) 



> 




50000 10O0O0 
computed MW 



150000 



y^+bx+cx A 2lnx+dx A (2.5)+ex A 3 



D 



ysa+bx^lnx+cx^.SJ+dx^ 
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CO 
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figure 2. Plots showing fits of selected equations (continuous curves) to data on identified proteins (square symbols). (A) p/ computed from 
sequence data »«w gel X position for identified spots in F344 rat liver, (B) M r computed from sequence data versus gel Y position for identified 
spots in F344 rat liver; (C) gel X position for spots in B6C3F1 mouse liver versus X position in F3443 rat liver, for coelectrophoresing spots; (D) 
gel Y position for spots in B6C3F1 mouse liver versus Y position in F3443 rat liver, for coelectrophoresing spots. In each case, inverse equations 
were also computed (Table 2). 
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Figiw J. Master 2-D gel pattern for B6C3F1 mouse liver, standardized using the rat liver pattern identifications, according to tlje method 
described in the text. Twenty-nine proteins are identified. 



PAaT PLASMA — AaTUVE* X-pl (/raT PLASMA ♦ LIVER X-RAT LIVER X 

(/RAT PLASMA X-RAT PLASMA ♦ LIVER X (^RAT PLASMa))) 

(8) 

This unified approach, in which one well-populated 2-D 
pattern is used to standardize a family of other patterns, 
has the additional advantage that the resulting pi and M t 
scales are directly compatible. Hence one can compare 
the relative pT% of mouse and rat versions of a se- 
quenced protein in a consistent pi measurement system, 
and select likely inter-species analogs based on posi- 
tional relationships on common scales. Adoption of 
immobilized pH gradient (IPG) technology [4-7] will 
result in . substantial improvements in pi positional 
reproducibility for standard 2-D maps such as those pre- 
sented here; however, we believe that our approach will 
continue to be useful in establishing the empirical pH 
gradient actually achieved by such gels under given 
experimental conditions (temperature, urea concentra- 
tion, eta), in relating patterns run on different IPG 
ranges and using different lots of IPG gels (between 
which some variation will persist). Development of 
rodent organ maps is a continuing effort in our laborato- 
ries 18-10], and results in regular additions of identified 
proteins. Those who wish to receive current rodent liver 
maps, with color annotations, should send a stamped 
self-addressed envelope to the first author. 



We would like to thank the individuals who provided anti- 
bodies mentioned in Table 1, and R. M. van Frank for i/n- 
published sequenced data. 
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identify all cDNA .spec.es. and the approach doe., do! easily a],*, a 
screening. Analysts of gene expression by ,hc s.udy of protein* present ,n ■ ce7n 
t.ssue present, a favorable alternate. This can be thieved bv use of tu o^ im .„^ 
<-D> gel electrophoresis, quantitative computer , mage anih s.s. and protetn Idem* 
canon techniques to create 'reference maps' of all detectable protems. Such r-ferencr 
maps establish patterns of normal and abnormal gene expression in the or C an,\m and 
allou the examination of some poswranslational protein modification^ , ^ 
functionally important for many prorens. h is possible ,o screen proteins 
cally from reference maps to establish their identities. * 

To define protein-based gene expression analvsis the concent nf ,k- 
«as recently proposed (Wilkin,* ,995: Wn^^^ °{ i^om-TTh 
enure PROTe.n complement expressed by a genOME. or bv a cell or tissue tN nV Vh, 
concept of the proteome has some differences from that of the genome, a, whither 
« only one definit.ve genome of an organism, the proteome is an entitv u h ch can 
change under difterent conditions, and can be dissimilar in different tissues of a' in.u 
organtsm. A proteome nevertheless remains a direct product of a senome Inte^ 
tngly. the number of proteins in a proteome can exceed the number of 8 e„ e s D ™T/ 
as protein products expressed by alternative gene splicing or with dVferent Z " 
translauonal modifications are observed as separate molecules on a ^. D ^ T 
extrapolatton of the concept of the 'genome project', a proteome project" l esearch 
vvhtch seeks to .dentify and charactense the protetns present in a cell or ti^ue and 
define their patterns of expression. nu 
Proteome projects present challenges of a similar magnitude to that of *enome 
projects. Technically the 2-D gel electrophoresis must be reproducible and^of h£h 
resolution, allowing the separation and detection of the thousands of protein, in a cdl 
Lou- copy number proteins should be detectable. There should be computer *el ima "c 
analysis systems that can qualitatively and quantitatively cataloc the electrophoreticallv 
separated proteins, to form reference maps. A range of rapid 'and reliable .echnSS 
must be available for the identification and characterisation of proteins As^Z * 
quence of a proteome project, protein databases must be assembled that contain 
reference information about proteins: such databases must be linked to «cnom,c 
databases and pro.e.n reference maps. Databases should be widelv accessible and eTsv 
to use " " ,,u c « N > 



foMh/nn^.' r m . 3ny Chan?CS " ,hC ,cchn, <^ ^ resources available 

tor the analysis of proteomes. I, ,s the aim of this chapter to discuss the statu, of the 
^outlined above, and to review briefly the progress of some current proteome 

Two-dimensional electrophoresis of proteomes 

Two dimensional CD, gel electrophoresis involves the separation of protein, hv their 
uodecme potm ,„ the firs, dimension, then separation accord, n| to molecular w £h 
j < ulfa,e «l««>phoresis in the second dimens.on. Since Is 

descr.bed (Klose. 1975; O'FarreU. .975: Scheele. 1975,. ,, has become the me tod o 

tion's !n ,H ^T": * COmP '" n,iX,UreSOf P™<™<">*« »«h manv modi^a 
t.ons to the ong.nal techntques. 2-D electrophoresis fonrn the basis of proteome 
projects through separatmg proteins by their size and charge (H K h tra se e,T 
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vital io allou comparison of gel. from day to dav and he»«™ re. -ar-h si,. T , 
factor, can be diff.culi to achieve. ' M,e> Th ** 

Orner ampholyte, are a common mean, of isoelectric focusm*- for ,h- m . 
d.mens,on of 2-D electrophoresis. GeN are u.ually focuse J to equilibrium to <4 ate 
protem, in the pi ranee i to 8. and run in a non-equilibnum m„de iVEPHGEmo 
<*pa.ate proteins of higher pi (7 10 1 l.5i tO'Farrell J97* (VFanHi r , 
CTFarrel,. .977, Unionunately. the u.e of ^r^^t^^ 
forcing procedure ,s susceptible to cathode drift*, where* pH «rrad,em* estahh Z , 
b> r efocus.ng of ampholytes s,ow,y change w, th IIfne (Rlche P tll anc^ " 
Carr.er ampholyte pH gradient, are also distorted by h, 2 h sa,t concentrate of 
sample. .B.ellqv.s,,/,,/.. J9S2). and by high protein load lOTamM 19^, Afunhr 
limitation . < that i.o electric focusing gels, which are cast and subject to electronhor, 
^ in narrow glass tubes, need to be extruded by mechanical mean* before armlLion 
to the second dimension - a procedure that potentiaJlv distort, the gel Nevenhcle 
many of the above shortcoming, can be avoided by loadinc .mall amounts of "C or 
radiolabelled samples (Garrels. 1989: Neidhardt et aL 1989: Vandekerl-hnv. ., 
.™. ™„ scns.uvny detection is then achieved throuch use\>f fluorc-ranhv or 
phosphorimaging plates (Bonner and Uskey. 1974: Johnston. Pickett and Barker 
1990: Patterson and Latter. 1993, However, this approach is on.v pr^c^ 
organism- or tissues that can be radiolabelled. 

An alternative technique, which is becoming the method of choice for the firs, 
d.mens.on separation of proteins, involve, isoelectric focusinc in immobilized nH 
gradient , IPG , ge« , BJe.lqv.st „ 1982: Gorg. Po.te, and Gumher.^t r " e ^ 
1990,. Immobilized pH gradient, are formed by the covalen. couplin* of the nH 
gradient into an acrylamide matrix, creating a gradient that i. completely stable with 
..me. IPG gels are usually poured onto a stiff backing film, which T'nie hanic K 
strong and provide, easy gel handling (Ostergren. Eriksson and Biellqvist. 1 988 The 
major advantage, of IPG separation, are that they do no. .uffer'from cathodic drift • 
the> allow focusing of ba.ic and very acdic proteins , 0 equilibrium. pH cradicnts c -.n' 
he precisely tailored .linear, stepwise, sigmoidah. and tha, .enarationCovcr a v C n 
narrow pH range arc possible (0.05 pH units per cmnRiahctii I WO- Bielluv.s, „ i 
.982. ,993a: Sinha ,,,/.., ,90: Gorg 9 * 8: Gcl ^ 

I m, Houevcr. .. no. currently possible to use IPG «N ,o separate xerx h .siJ 
protein* of ..oeleeinc pom. greater than 10. although th.s vmikr development 
Narrou p H range separations are useful ,o address problems of pro.cn co-nuLion 
m complex samples, allowing 'zooming in" on regions of a «l ,*W ^ p£ Z 
strips are now commerc.ally available, which begin to addres'. the problems „ f in 
and inter-lab isoelectric focusing reproducibility. 
There are two mean, of electrophoresis for "the second dimension separatum of 

1988,. Both are usuall> SDS-containing gradient gcK of approximates 1 1* "/ 
u.rylam.de which separate prote.ns in the molecular mass rancc of 10 - | M)kD A 
packing gel is no. usually used with slab gels, but is necessary when us.nc horizontal 

Uu fcor no c ^ GUm , her - ^ C ° m ~ h - shown mat there 
role or no difference in the reproducibility of electrophoresis usint: Cll her approach 
Corbe„ ,,„/.. ,994a,. but commercially available vertical or horizontal pre™ 

u .11 prov.de greater reproducibility for occasional users. For slab eel elcciropho eT 




Fipure2. T». -dimensional pel electrophoresis sllowv **,„,,„.. ,n • „„ , 

: rrcneint e.nnmon 10 each eel (A . Wide p| rawe im. d.™«T?. -ti , M R, " ?v h, ? h,, ? hl 

P"ne,n> F.rs, d.mension ,en 3 r 3 „o„ uMnrTn Z ? TIT^" m ^ " f •«««• P"-« 

Tnc second dimens.on SDS-PAGE tcZ eel s 2 ™ T PH • rjJ,Cn ' ° f ^ ^ "' ° un "> 

ru.n» nu P The f.rst d.mens.on used a narrou r3ncc -mrnoh" , J pH en/ ?f ;"''' n " f '* 
^;*nnd dimension ua< SDS-P^GE Mirrnnrrf^.. i i ^-wicm m m 5.. umtv and 

p«- ™, . ^ S's.'srfsS! "* rci h "" ,cj •• rvDF 

caahst been shown 10 glve be.icr resolution and hither wmitiviiv 

'HochMmsser and Merril. INI: Hod— P„chom ik and McrriL 
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Notwithstanding the advance described ab-ve. there is jn increase „ , 
improve the reproducibihty of 2-D electrophoresis to facil.ate toST^? " 
and pro.com. studies. Harrington « a/. (1993, explain t„ ai if 
proton spots, and there „ 99.5-, spot rnatchtns from eel to gel. thA u TpZ ceT 
spot error, per gel. Th.s amount of error. w„,ch might accumulate w„ h each ^ "."el 
companson used ,n database construction, could produce ,n unacceptable J- e / 0 
uncena.my in gel databases. To address these issues. panid automation of Lt ~ n 
gel separations ha, been undertaken (Nokihara. Moritaand KurikU 99° I , " ' 

.... lw ,, r „ no , £d ^srfe 



MICR0PREPARA7IVE :-D GEL ELECTROPHORESIS 

VVith the advent of affordable protein microcharacterisation techniques, including N 
terminal m.crosequencing. amino acid analysis, peptide mass fin-ernrintin It 5 
analys.sandmonosacchand^ 

pnorsMs has been to maintain high resolution and reproducibii.Iv but^o p ' 1" 

protem.nsufnc.emouantitiesforchemicalanalvsis.highnanoenmtolow 
quantities of proteins per spot). This becomes difficult ,o achieve with 
samples such as whole bacteria, ce„, ,s the initial protein ,oa^U^damo^S 
to 4000 protein species. Two approaches are used for producing amount n 1* - t 

and'ooo, th ChSmiC f CharaC,CrSCd - firSl 

und pool the spots of .merest, and subject them to concentration ( j, „ «/ ,W WaS 
also act a, a punficat.on step to remove accumulated electrophorctic contaminan 

of IPG isoelectric forming. The high loading capacity of immobilised nH %r . 
was described early ,Ek. Bjellqvist and Ri^hetti I9«i hT h-? i pH ? ? d,cms 

mg of protein can been applied ,o a s.ngle gel. yielding Z^^Z^lL 
dreds o» protein species, a funher benefit of this approach i/that oro.Z » ° 
ow abundance^ch may no, be v.sual.ed by iZr protein ™ ^ eh" 
to be detected. The use of electrophoretic or chromatographic pre fractional ~k 
niques (Hochstrasser c,aL 1991a: Hamngton „ «/.. I W2, foHou^ b v^hT 
f-™e,PG separate ^ 

«uclies on proteins present in low abundance. ' lo 

Methods of protein detection 

^ dt,ec,i "S f ™ :-D (eh. The mchod u ,cd win be 

mclaied hv factors includint proie in load on .el lanalwirul ™ „,.„ ? 
purpose of the ? el ,fo, pro.e,„ /uami.auon or fcrT^S^nSS^- 
■on, and ,he S en Si ,ivi,v required. The mo S , common me ? an< o p ZT»Z Z ^ 
.he,r apphcauons are <how„ ,„ TV,,,,, /. Mo S , de,ec,ion me,hod 'have dra"" t" to 



Progress with pmivnmv /uvi/mt 
Table I: C'~^«n <ia».»< for :-D gel* or hints and ihetr arplu-ammv 



Deiernon 
Metnod 



Main 

application* 



InsunaMe 
application* 



Senium 



Retrrence* 



i^SI Met or -C 
radinlahclling and 
fiuorograpnv or 
phosphonnuging 

I "Slthmursa wiiver 



Siivcr 



CoomatMC hiue 
R-250 



Cell lines. 
.'ultureJ nrtanttrm 



Sampie* that 
cannot re labelled 



Colloidal cold 



Zinc imidazole 



Ponceau S and 
nmidf hi.uk 



it id i.i ink 



Stain-all 



EuremeK high 
ien«ui\n\ pel 
naming 

\ erv lugh *cn*i- 
i vnv eel naming, 
can be memo or 
poh chromatic 
Staining of eels, 
diamine of PVQF 
memnrancs heicire 
protein tcqucncine 



Staining NC 
membrane*, 
(taming PVDF 
he fore direct 
MALDI-TOF 

Reverse staining 
of ecU or mem- 
hranes: may he 
beneficial in 
MALDI-TOF 
of peptides 

Staining higher 
prmein load* on 
PVDF. fur protein 
teuucncing or amino 
acid anaK 

Staining of 
mcmhrane -hound 
proteins, \tainmg 
PVQF heforc direct 
MALDI-TOF 

Stjining 10 detect 
glycoprotein* tir 
Ca " binding 
protein* 



Preparative 2<D. 
PVDF or NC 
membrane* 

Prerarame 2-D. 
PVDF or \C 
membranes 

Staining prior to 
direct ma" deter- 
mination from 
PVDF: amino acid 
analysis on PVDF; 
detection of some 
glycoprotein* 
Gelt 



Where positive 
image i* required 



Staining prior to 
direct ma** 
determination iroi 
PVDF 

Gel naming, not 
quanttiatixe from 
protein td protein 



General gel naming 



:eppmnf GancKanJFrjn/a. 
radiolaret in 

a V** Latium. Carre* and 

Solt-r. Nu« 

li J ng prmein \\ allacc and Saiu /. 
on ipot i.r band l«Vlo.h 
of gel 

J ng protein Rahitli%uj. |uu; 
on sfHv or HochMra^cr .rid 
hand of gel Mem!. IVXS 

«W» ng protein Strupai a a! . |wsi4 ; 
on hand or Gharahdachi rt a!.. 
spot of gel 

Guldhcrg rtnL IWKK; 
Sanchez rt a!.. 



«> * higher Yamaguchi and 
than Asakaua. |yR8 : 

cooma>MC Eclcrskorn a «/.. 
IV92: 

Strupai rt at.. 
Higher than Oni/ /•/ «/.. 
OKimjNMC Jamc*r/«/.. 



n ? Sanchez rt aL. }*W2: 

protein on Strup.ii rt at.. 

hand nr *pot \Vilk, n * r ; at.. | v*>5 
of ycl 

l-lllnc Li rifi/.. 14X9. 

Hugho. Mack and 
Hampanan. I«SK. 
Strupai rt at.. |w«J 

IU»ng protein C'amphcll. 

on hand ,.r M.uLcnnan and 

*pot nl - C l J«'fgcnvcn. IQ.sv 

Goldberg r/ *//.. |vxk 



r..,n, m ;« ue sr ^d..™ . -» « ^ — t 

example some glycoproteins are not stained by coomassie blue (Goldbcrs ct al 
1 988 1 and many orgamc dyes are unsuitable for protein de.ect.on on PVDF .fsamnles 

Although most means of protein detection give some indication of the quantities of 
protein present, in general they cannot be used for global quantitation. This is because 
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no proteii. <tain i< able eminently 10 detect proteins over a u --fe ran o e of _ 

.innv isoelectric p0 tm< and amino acid composttion, and u„h . £^ £ 

ro«.trapUaiionalmodincaiion*iGoldbcrc«fl/..J988:Lifii// 1989, Funh 

there are large difference^ staintne pane,,, when ident.caJ gels' or bio.s are suh™-d 

to differ.™ stains, mclud.ng am.do black. ,mida2ole zinc, .ndia irk non^u / 

colloidal gold, or coomass.e blue .Tovey. Ford and Baldo. 19.7; OntZ ^ p" 

llie rr.o a common means of quantising large numbers of protein, in a " ; n c- 

involve, the radioiabclling of protein samples prior to el-r.r«nh™ , ' - D 

quanrta.ion based on flun^i ? ° cLclro P ho ^'v and prote.n 



quanrta.ion based on fluorography and image analysis or liquid scintilla,™ 

not conuin 

acid ,« of protein spot* ^i^^ViA^^A^'^^^^- Am ' n " 



meth.on.ot cannot be detected if only p S ] methionine is used for laH-ll 
acid analysis of protein spots v.sualised by other techniques prevents a ifkelv means of 
protein quantitation for the future. • Jn pf 



Ait 

BLOTTING OF PROTEINS TO MEMBRANES 

\ re** 

or r 
pui. 



EJe^tropnoretic blott.ng of proteins from two-dimensional polvacn-bmide , cK to 
membranes presents many options forprotein identification and m,crocha™c*rT^on 

wh,ch are not possible when proteins remain in gels. For example, when p n " " ' I* 

bloited to polyvinvlidene difluonde «PVDF, membranes. ,hev can be idcnS b N ^ 
«™.nal<eqi*nc,n^^ 

to endoprote.nase digestion, monosaccharide analysis, phosphate ana.vsi t d c t 

ma nx-asMsted laser desorption ionisation mass spectrometry (Matsud ura 

V Hk,ns, /; ,/.. ,99,: Jungb.ut,,,,/.. ,994: Sutton „„/.. ,995: Rasmussen, ^ j£ 

Ue.zthandler r, „/.. ,993: Murthy and Iqbal. 1991: -Eckerskorn a „/ T^Mt is' T " h 
possible ,o combine of some of these procedures on a single protein sp 0 ( >n a'pVDF 

membrane .Packer „ „/.. , 995: Wilkin* « aL submitted: Weiahandte « «?/ ,99^ — 

Th. ,s useful when rn.nin.al amounts of prote.n are available for anaK M s' TW " ^ 

cchmque* w,„ he explored in detail later in this review. Withstands the love ° El 
there are some disadvantages associated with blotting of prote.ns to bra . " 

The „ ^ ]oss of sumple dunn? Woli . ng ^ ( £c P c n brjnc v q IE. 

1993.- and common prote.n detection methods are less sensitive or no, , P „. • ,ki 
membranes , T „ Mr 1 , presenting difr.cult.es for the anaK ,i 
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Follow ,„, pro , c ,„ elecrophoreM* and deletion, detailed analv,,, „f ,. el „ n ,„ cs js 

Reference eel unuges. c on «rucied from one or more eek. form the h-,!,W Z 
d,n,c„„ona, ? e, da,a h . K . These d Mha ,es „«, ^ prole ,„ ^ ^ ^ 
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digress with pmuu,„, c p m j en% 
d.uaih of their po ,-trans.auon.il modifications M> *l 
linked «o or infrared tt , lh comprehensive pVoi™ and n i ?inninr 10 
(N>idhardi r/fl/.. 19K9: Simpson ««/ ,99' ^ ^-nucleic and double, 
databases, comamins DNa sequence data. chr'nJT , and or ^nj.vm- 

D „U and pro, : ,„ fu , ct , onal 1^^^ -*»« 
£ and p r „,eorn t pro; icl < propeS( ,v«B^„" , *t' W » i 

Database cued in GarreU e: ul.. 1 994 j. 1 W ~ » Cu<t Prote.n 



GEL IMAGE ANALYSIS AND REFERENCE GELS 

After 2-D electrophoresis and protein visuali«afi«« k • 
phosphonmag.ng. ,ma 2 es of 2 eis are disi ^li^d'fo " ' Sta,mn? - nuoro ?^> or 
-anner. laser densnome, or char 2e co^l TJZ^CcZ ^ 
Ceus a!., ,990a: Uru- in and Jackson I*™*- ,9S9; 
resoluiion of J00- 200 mm. and can detect a ^Ic^nlT ^ U " h a 

or more - gny scales ). FoHou- in8 mis. 2e j ^'l^T* °> ***** 

pulation, to remove venica! and nonzontal ^S^^ '° of — 

<pot position, and boundaries, and to calculate snoMn, lutk ? 0ui,d h '«- «o detect 
<P°« 'SSP, number, comaimn* ven.cal a td ho^o A wndard 
assigned to each detected spot and becomeTft^^ infora «"ion. 
««* some notable softwares Xb Jo^V?^~ 

Tahlt s„mr S„f.u Jfc P atrtagt , f „ r , h . Analysis nfGcl l„« ces 
Gel Inure A nal> s,< Svhcih Reference*- 



ELSIE - A: 
CELL AB I <L II 



OKer. 3 nJ M.llcr ,9K 8; u,„ h e , aL 1W|; U (n(i ~ 
Lemon ani | U P , on . Lcmk.n. w„ anJ JI^' £ 

0LES7I* lUnd POOL-EST S^Sv't'- "'f^^ rt ^ 



MELA ME | * |j 



rYCHO* KEPLAR 



-•000 ,o 4000 p ro ,.,„s from ont „,* ^ ™'J>"« mvoK e , ,„ c nu , ch , . of 

manually de«i In «« approx.ma.tfv 50 or vo „Z ' " h> ' an °' >cr; " 0 '- >v "o 
■<•■« c,os S . ma , cnsd . p P r P 01e , ns S!^"'"'""^"- " ^ 

i"in| compuitr-bawd vecior al.oriihm, ^ ^ ' esuWBh «l around landmarks. 

Clo« ,o 10K of spots C r"'' ' hC mWChin - c ov " -d 

al.hou ? hdiff C r t n,d«rLXS 'l™"" by ' h "' mtlh ^ 
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JcnM.omc.cr. ,B , Gci .mace after prc^s ,1 7*' ° n?,na ' m3 * C " c ^u,cO hv laser 

ol oil s P o.s „n ,„ e ce , " P"«"«"f «o remove «rcai.„g awJ background <C, Ou.l.nc dcfin.uon 
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/Wr«.t wttli prou nniv pr„, va% ^ 
CALCVLAT10N OF PROTEIN ISCILHCTRIi* POINT AND M015CVLAR U-£, ClfT 

Estimation of ihe isoelectric pom. «pl: a nd molecular weich. (M\V, 0 f proi-.n. 
•D ? el ; p:ov,d : , fundamental parang for each prof™. ^2^^ 
dunnc identification procedure .sec f jllow.ng section.. The pi and M\V of nro,^ 
are record ,n 2-D gel dataha-es. Accurate e>„ma:ion> 0 f proL p! and £ 
obtained by using 20or more known pr Jt ei„s on a reference map ,o construe, sundart 
a,™ of p and molecular we.ght. which art then used «, calckuc cst.ma L P , a , d 
MU of unknown proteins (Ne.dhard. e, aL 19S9: GarreK and Franza 1089 v V 
Bogelen. Hunon and Ne.dhardi. I99L: Anderson and Anderson 1 99^ An f 
ut - 1 99 I : Latham „ aL 1 99: , Alterr^vely. the MW o^S^^ Z 
.o P\ DF can be determined very accurately b> direct mas, spectronK^ E' k C r ^ r 
« oL ,992, Where irnmobiI „ cd P H gradient, are used, me foc^ p ^ ^ 
proteins a, *., thetr pi to be measured wuhin 0.15 unit, of that ealculirt 
aminoac.d sequence (Bjellqvis.r,*/.. 1 993c i.limuM be noted however tha/n T 
carrying pos„ransI,,onal modifications may mterate to u,!cxp ^Tmw 
positions during electrophoresis (Packer ei aL 1995). P 

V 

SPOT QUANTITATION AND EXPRESSION ANALYSIS 

A major challenge faced in proieomc projects is the quantitative analvxi. of nro,™ 
separated b> Z-D electrophoresis. The mos, accurate means of protetnqua nn2 Z 
to determme chemical.y the amoun, of each protein present hv am^a • d con 
national anah s,s. However, .he current method of choice for quantitative anal" ; 
of many protons ,s lo radiolabel samples with [»S] methionine or "C amino acid 
perform .he 2-D electrophores.s. and measure protein levels in disintc" Ton t 
mmute .dpm. or un„s of optical dens,.y. Quantitation is achieved SKt^Sd 
-nt, iauon coummg. or In gel image ana.ysi, .here spot dens.ue, are quan tat d 
^ W ? Z ' jhbrM,0n N,npV C " ma,nin - C known of rad.olaM d 

« «/.. 1990: Celis ct aL. 1990b; Celis and Olsen. 1994: Carrels 1989 L iil! 
GarreK and Sol.er. .993; Fey , 1994,. AU approach, effect 'vd v ulhnv ^poiv'Jo 
ne normahsed agams, ,he .otal d.Mn.egra.ion. per minutc loaded o„,o 
Lim.tai.on* thai remain with rad.olabelling methods are tha. aKolmc quantitation 
not achieved because all pro.em, have varvinc amoun.s of anv an^nn L ,7 
on,v eas„y .abe.led samples can be invested. Quan^,? ^ ^ 

an ahernauve .Giome,,, c, aL 1991: Hamnc.on c aL I9«:. RcH jri C uc2 cnTmt 
lynck rt „/ .993,. wh.ch when undenaken w„h PSJ.h.ourca , Walla e a d S L " 
^ of extremely high venshivitv. " 
When pro.e.n spots from samples prepared under different conditions are quantised 
,nd matched from gel to gel. i, becomes poss.ble to examine chances and pa ueni n 
Pro.e.n expression. Large scale ,n vesugation of up- and down-re^tio^ ot p o" mT 

rans;r a H J r C "I «" hf "^-aken. For examp.e. sjL^% 

transformed human keratmocytes were shown to have 1 77 un-rculated ^nd « v 
regu ated proteins compared to normal keratinocv.es (Celis ITo^lSj, ^ 

""^ ,99L ,99 ^^nd 4 proteins out of 1971 were found to be markers'for 
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cadmium Icr. icily in uniun protein* fMyrielin,,/. iooj, OwJ., . w , fc 
in p.-o, ; ,n e^™ „ . ,„„i, of jene d,<n.p„o» SIU ve *. bJi^^t. 
-tf P Mo.. -L^l Phonal communication,. !mpre«ivelv. b~ e „, ^£ 
prcem expression under different condition, can be clob.ll> ",n« ,, " e d u" 
^ „ ahnd. , ha , fi „d group, of re.ated object., uihin a «,. For « 1 7 
KfcF^2 mi ell line daiatme. confine of 79 rHc f™« i -> . " jn, P ,te - inff 

cx.mnc. v conduced o, Z E < * Pm,e '°- ,hM - fa 

nmenuul for inv C ,,,„,,on of cellular control mxteZn?** 1 S0>n " M " ta ' 
curr : n„y techm.-ally , rapoKiblc usi „ s Jctae^^SS^ ** " 




Hunun hrari dainr.a\c* 



Human kcrjnnn;-\ ir da:aha*c 



\U*u*; vmr»r\i. daianasc 



M<»u*l- Jivcr djiahave 
* Af^nnnc Protein 
Mapping Group » 

R.'i i'\rr epithelial dni.ih.ivc 
Rj: Itv cr datable 



REF 52 rji cell line daiahasc 



-^V|5S :DPAGE cufiMininc 
Human rricrerwc map* 



^ c:im Prntem Donate * VPDi 
and VejM Elctrirnphoreue 
Pr i uein Database (VEPDi 



Gci vp« M » iinkcd with GenBank 
and kohara clones; quantitative 
spot measurement* under differ- 
cm jrrouth condition* 

Identification 01 disease markers; 
»"<» separate database* have 
been established 

Extensive identifications; 
uuanmative spot measurement* 
of tranviormcd cells: identtfica- 
mm til d»*ca*c marker* 
Quantitative vpot 
measurement* through 
I m « cell mjl*c 
Documents chancev due to 
c\puxurc in mni/inc radiation 
and chemicals 
Detailed subcellular 
iracfionanon *iudie* 
Extensive ciudiev nn regulation 
ni protein- b> druc* and io*,c 
aeents 

Acces*ihlr via World Wide Weh 
uuanutaiixc *poi measurement* 
under difiercm conditions 

Aeee**iblc via W ( trld Wide W C h. 
complete!* intccrated u ith 
SWISS-PROT and 
SWISS-3DIMACE 

CompleteK cro\srclereneed 
oreanism database. VPD has 
extensive inlormation on over 
VVK) proteins; VEPD has 
manv identification.* 



Baker era!., |yv: 
Cornell rial.. I9wjh 
Jun^Mui rial.. IVWJ 

Cell* cj al.. (wvtJa 
Cell* c / al.. |wy3 
Celt* and OUen ivyj 

Latham n at.. | wy | 
Latham r/ al.. 

Cionu-tn. Ta>| ( «f ;„,d T..||..ksen. iw^ 



^in:,r//;/.. (uyj W lft h,7 (/ /. ,g U ; 

Ande?Min and AnderMm. I«w|. 
Anjcrsnn r/ it/., iwy^; 
RwlurdMiM. Horn and Ander*.Hi. |ugj 
Can-eU and Pran/a lyxy 
Bouteli r/ a/., lywa 

Appei r//x/.. iyy^ 
Hi>cn-tra**cr r; «/.. |uy2 
Huche* rf |yy ; 
Cfla/ rr al.. lyy* 

CarreU r/«A. |yyj 



in;\*r:: .. 

:-d 

of reic 
^houlJ 
Macim 
the ar^ 

ann(n:r 

One 
SWISS 
1904; 
feature 
2DPAi 




Inlitrm:- 



Anmua 



Rcfw 
Datah. 



Other 



FEATL-R£S OF PROTEOME DATABASES 



Proteome projects rely heanlv on computer database to store ,nforma,, on about all 
protons expressed by an orpntsm. 'Pro.come databases" should contain dl , ' 
information of protein> already characterised elsewhere, a. udl a< protein data from 
:-D gels such as apparent pi and MW. „ pressi0n levcI undcr J^J^ 
subcellular location. anC information on post-translaiional raodifiaiionx l mw ' 

u T B ? ' SSCW ' m? Pr0,Cin SSP numbers and P roi " n idemirica^' 
should also be included. Ideally, proteome databases should be acccs ,N C 

Macintosh or IBM persona computer, and easy ,o use. Some proteome dataha e. and 
.he areawhev cover are l.ied in TaMe Databases ranee from collccoV of 
annotated g eU to large databases of images integrated with protein and nudeu acid 
sequence banks. t,u 

One example of an integrated proteome database is the suite of SWISS PROT 
SVVISS-2DPAGE and 5WISS-3DIMAGE databases , Appel uL 199 V { DD 1 °, 

features of these three databases are listed in Tabic V. SWISS-PROT <u»cc 
2DPAGE and SWISSODIMAGE are accessible through the World Wii'w* 

Table 4: The SWJSS-PROT. SWISS-IDPAGE and SWKc w\i A rc 

AW ,nr J; ^n,„ 3 , ^H* mn.ueh ,hc w2 uS^XXS^ 

e*pn*} hcuet.ch/ » 



SWISS-PROT 



SWISSODPACE 



SWISS* JDIM AGE 



Inionruimn T 



Text entries of sequence data: 
Citation information; 
taxnnomic data. 3b\ J03 
entries in Release 2V 



Annotation* 



Protein function. 
Po<r irnnstononal 
modifications. 
Domains. 

Sccondan structure. 
Quaternary structure. 
Diseases associated 
w tth protein. 
Sequence conflict* 

SWISS2DPACE 
SWISS- JDIMAGE 
EMBL. PIR. PDB. 
OMIM. PROSITE. 
Medline. Flyhase; 
GCRDh. MatzcDB. 
WonnPcp. DicnDB 
Other Features Navigation to other 

SWISS daiahate* achieved 
h\ selcrtinc rnmes u uh 
computer mouse 



Croc*. 
Refcrrnced 

Dnsanas.es 



2-D pel imaccs of: human 
liver, plasma. HepG2. HepC2 
secreted protein*, red hUid cell, 
lymphoma, cerebrospinal fluid, 
macrophage like cell line, 
cmhrolcukemia cell, platelet 

Gel imaces uherc 

protein i* lound. 

Hou protein identified. 

Protein pi and MW, 

protein number: 

normal and pathological 

variant* 



SWISS-PROT and all 
other databases 
accessible throuch 
SWISS-PROT " 



Gel imaccs shou position 
of identified proteins, or 
region of pel uherc protein 
should appear 



Collect ion of 
imasre^ ol proteins 



Ml anmitairon is 
.oiiil.ihk- m SWISS. 
PROT 



SWISS-PROT and all 
• 'ihcr databases 
accessible thrnuch 
SWISS-PROT 



Mom« and stereo 
■maces available. 
I mates can be 
translcrrcd 10 local 
computer imace 
viewing procrams 



%2 
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(Bemerj-Lee « <i/.. 1 992 1. allowing an \ com* 

« . pmm. whidn, ow, acid < eque ™ at^L™ *"** " lfo ' TO "">" 
canons, can h* obuined. in; p re c,« prottm " ?7 ^'-'^"onal modifi- 

R^ncc. ,o nuCcic J and « * ~" if 

••■cress to information stored elsewhere. aa ' Jf **e* are j.so men to prov.d- 

Orcanism" databases, containine rim-Mi.** _ 

» specie,, are teccmun^^^" ^ BHC, « ** '"^rmat.n 

PROT because they are ,ma 2 e to«r^^^t KCji,,keC,CnBaBkorSU "'SS. 
map posinons. iran.scnpt.on* of senes wd ^l ^ ^ chromo ^ntal 
c W/«„ c,,/, sene.protein database iCsS u*™™ ^ *• 

VanBoseien and Neidhardt. 1991 ^ VanBoo!^ T Neidhardt - >990: 
EC02DBASE. » one example. 1. contaTn!^".," aL kn — - the 

information (including pi and MW estimate^'and 'T^r ^ 2 ' D * eI *P« 
matton (GenBank orEMBL code, chromosomal ^ ,de " t,n " l,onl - *encik- infor. 
• Kohara. Akiyama. and Isono mTZ^ lo ^ ^uon on Kohara clones 
regulatory information (level of pTotein eSS 7''?" ° f * Cncs,: and P™™ 
memher of re ?u ,on or stimulon^ Te.^^ S^KS^ 
referenced to the SWISS-PROT database (Ba.rocn and B^'l ^ ^ 
anticipated thai orsanism databases will soon h^TI B ° eck ™nn. 1994,. i, is 
available .nformauon about a pan* K ^r nd 1 mCanS ° fMnr,n ^ 11 
consistent manner tn »h,ch orsamsm ^Vl CUWM,y "° 

comparisons , n the future. ' assembled, vvh.ch ma> hamper 

Identificatiun and characterisation of proteins from 2-D o tl$ 
The number of prote.ns identified on a 2-D reference man 

a research and reference tool. As most referent I ? dC,erm,nev l,s "^fulness as 
proteins identified, a maior aim 0^1^ ^ « ™" P"P— of 
from 2-D maps. In order ,«, define th-n a TnZ P ^ *° * rcsn ,nan - v P™** 
databases, or as unknov , n -. ^Sr^'^^^^crn 
open rcad.no f rjmss . and providff ^ M ^ - confimn,„o n of DNA 

charac,erisat,on efforts hv P poi nlI „, tf ^ pro ,c ^^''"f prt ^ iccu anJ P ro ^>" 

.VKKU4000 proteins from a s^te%l^ h 7 ^ Sinee ,h?r; ^ ^ 

pntein screeninc ,s ,o idem ^ B ^ ^ ^ ^'i-,c in 

Traditionally, protein, from I Z * k V m ' n,nU,m ° f COM and c «™ 
."■mu„ob,ot„n g P s^JZ mic^p^ ' nte^ * ^ ^ 

comtpaion of unknoun protein, ,,h ^ 2 hTo SeqUCncinp - 
homolo2ousn cn - sofin , • . " P ro, cms. or h> overexpression of 



Table ?: HierarchiraJ anal> m.< for mass scrremnc of * Pi 
Rapid and me rrr^ivc te; ;muue < arc u*cd a; a r,«\ ^Wied rmiew* h| IIIICd 



Order Jue nufiraiion icirnmou: 
Ammo a;*io ana >sn 

Am.nn ac.d 3 'ul..<is w» n Vternnnal sequence ia; 
Pepude-nia« in eerpnn.inc 



Combination ot amino acid anaU$i< and J? 
ma** fingrrpriniine 

Ma« spr.-tromrtr> sequence uie 

Extensive N-tcrmmal Edman rruLviKequencm; 

Jmernai peptide Edman m;rr<Kequcncinj: 

MnrrtAtoueni-inf h> m:« *ptwtromem leteriro. 
*nra> ionization, nosi-souri* dera> MALDI-TOF) 
Ladder <cuuen.mp 



Reference* 

JuneMuir;*/;.. |gu : . Sluu , uu , 

HoKihm.Houirue^canJ.SanjVr jucj 
iunpWu. ««/..|.n«. W| , lm . , 7 „: ,^ 

*ilkm* n«/.. .^uhmiiicd 

Mann. Hcrur and R^ rMltf|I JutU 

-Nuiiiin r/«/.. |yy5 

Cnrdu-ell rr a/.-. 1 905. 
^"asmeer auL 

Mann and Wilm. Jvvj 

Niaisudaira. IVK7 

Rosenfeld rrn/.. ivv: ; 
Helifnar. n aL i vy.s 

Johnson and W jNh. ivv^ 



alur nJ i I vc,o t r a d,t 1 onal a pproach ? s,r^/c.5:W a sin 2er „ a/ !9 o„ . . 
use of rap,d and cheap identification tool, such as Im,„n . , Th,s,nvol ^sthe 
mass fineerpnnt.ng as fi rsl Meps in pr^^^^^T *« 
Mou er. more expensive and time con,u m in 8 id~^^ ,hc ™ of 

the construction of this hierarchy the anaivsTs P i ,f nccc ^ 

of .he data created has been -ca^^J^^^^^^Y 
maenme t.me P er sample, the analvsis of data can „ ,iU,C 
con,um,n S . Am.no acid anal vsivandp -ptidema' £ ■ qU,IC ,nVO,Vcd ;,nd 
•«hniq Ue , in ,he h.erarchv are di<cu£5 n d ^ ^ M "T^ 2 ^ ^ficauon 
-ntification techn^ ,n s^^^, ^ protcm 

W07SIN IDENTIFICATION BY AMINO ACID COMPOSITION 

^fi««ion e ;? P ^ for 

Th„ technique u.es a protetn\ idiosyncratic arnmn Et ^r,kom « «/. . I98K|. 

.0 ident.fv ,t hy comp ^ mon ^iZ^^ 1 ^^ ^ ltl0n profi|c m or£jcr 

The am,no acd composn.on of h^T * ° f pm,eins " da,a ^s. 

«/.. 1994: Frev «<,/.. 1994, or bv acid hvZ , ^Phoresis (G ;,rrcls £ v 
chromatographic ana.v.i, ^^^t^o^^^"^ 
l988:Tou S « fl /.. 1989: Gh«h^ , I aC,d kl m * X,UrC '^erskom w „,.. 
'Wi. As differential nj^^^l^^ 1 ^ ™^^eu,L 
Phor-tmaee plate exposures of up ^^v^^^Z" ^ ^ ° r 
--lled^thetechn^ets^^ 
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Spa: zzztz-siy. 

■utt»:::n:c 



Asx 12.2 31x: Ser: 5.7 j ^ 

52y: i < Thr: j.6 AU: 6.7 Pro: :.'f 

?>t: :.3 >-S: 5.0 VaI: E.O Met: 0.3 

S.9 t,«a: 6.0 Ph«: 13. J ty«: <^ 

p: «--:a»:e: €.59 Ran?* searched: « €.«<, , 14 , 
Mw tt::u:r: ICBCC Rtn 9 « surged: (I3««o. 20140! 

Cis.est SWISS-??" cr.:r:« isr =ht tpi:»i ZZC_Z m— he- in • - 

Rank Scer« ?":e:a pi Mv BMsripsion 

1 2 ^:-;::r HI J!!:? 2? Bam » l activator 

5 43 JCV._r 8.5B 19769 HSKOLYSIH C. PUa"« 

"" \ 

Clsses: SWISS-PRC? er.zriei 'or ICCil with r2 ^ , . 

2 ..2 -^=-- : 6.73 17921 TRAJ PROTSXK. 

3 .2 : 6.79 19028 HYPOTHETICAL LIPOPROTEIN YAJ- 

I IS. SKS I:S 1SS 5SSS SA2 Srssrs 

ISS.PROT i..r £ The ;,.mx, ucnufuauon. aspartate carhamm hraiKiennc. s ,„u „ ,„ LT 
,«d,, a ,c j ,,,„j mat , h N,.,c h..« muuhm? *..h,«adcl.ncd r l and M\V , ansc ,,, m , ^ " 1 , ^ 
ha. ? rc;ul> ,», rea *d .hr sere d.l.crcn.c hemccr, «hc f,rt, and<c k „«d t^n^ rC 

■ * the lttivi idi-niifi.-aimn ( W.IL.ns r; „/.. 1 995, ' 1 '" C " T ranL, "r' Tfifin 

craph> -based anal> s,s. Pro.e.ns blotted 10 PV'DF membrane, can be hvdrohscd in I h 
l:.vC . am.no acid* extracted ,n a s, ng le brief Mep. and e;,ch sample automatically 

toTn ,urT d K h> ' Chro,TW,0 ? ra P h > in «nder 40 minutes <\Vil kins c, ai 

0u " ' ™> '• ^ th.s manner, one operator can routinely anah se 1 00 proteins 
per week on one HPLC unit. This technology lends itself to'automat.on. and i, is 
anticipated that instruments with even greater sample throushput will be developed 

T r^r'p H n VC P , rCparCd b} ' m,cro P re P a ™»< :-D electrophoresis (Hanash 
« «/-• I WI: B,ellqv,M , Wbl . bloUed l0 a pvDF memhranc H an£j ^ u ( h 

™ ° b '" Ck - »>• ^«Wj Pnnein spot ,, of sufficient quantity lor am.no acd analysis 
'Corduellr/«/.. I99i: Wasinper ««/.. 1995: Wilkins ««/.. I99<i 

After the ammo acid composition of a protein has been determined, computer 
programs are used ,0 match i, apain« the calculated compositions of proteins in 
i , , oo ( n E C c! rSk ° m " " L 1 988: Sibba,d - Sommcrf «^i and Ar,os. 1 991: Wblu, 

99 Match '"n >»< WUUns « 

199m. Matching usually done with only 15 or 16 amino acid*, as cysteine and 
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PROT: 

SEQU 
Whci 



S?st ECCII-ACJ 
sictscitsttsn 



5.'- Clx: 10.8 Sen 4.1 

3iy: Thr: 3.8 XI*: 11.9 

T:t: €.( Krg: 3.7 V4l: 9.5 

lit: 5.i. l*u: 8.2 Phe: 3.2 



Kit: 2.7 

Pro: 3.2 

Mtt: c.« 

tyi : 4 . 9 



Rank 


Sccrt 


Prcreia 


Pi 


Hw 








(.03 


45311 


2 


22 




5.8$ 


36502 
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38 




5.78 


45774 
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44 




5.86 


48018 




45 


I>HS4.rcCi2 
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4B581 
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46 
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37851 
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47 




5.98 


49162 
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585 


43290 
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50 




€.01 


370M 



p: ti:^ :t: £.99 lunge s*&rch«d: ( 5 7< e 3., 
Mw .i::rr., 45000 K^ g . s.« =hcd: (3<00q ; jJ'JJJ 

Closest JWZSSrPRrT enrri«s icr EC~ - w ^ ~* 

K * 2 K y 

82 * e t y > 

K A 2 E Q 
«KHU 
«iK9 A 
« S 5 X L 
K E S R 2 

Fipure 5. A PVDF protein cpot trnm an £ c,,/, 2-D rcicrcn c mm « . 

<on,e sample then Mineri u.amim. acid anah mv The N-ic^m^Kc ^ ucn " Cil ,l,r J **> and the 
a. id ciitnpiiMimn o! litr spm. well a* estimated nl and M\V ^^J*!**^ M L K R XVh ™ ihc amino 
PROT ,.«£. , „/, ,he *ne I,m of hes, mxd* * W 3,1 ■« SWISS- 

r.f tho, c entrie, The .o P rankmp .dcni.fiai.nn ^nnc Zdu ^ «™«SWISS.|>ROT 
wrc d.Mercn,, hc.wecn lhc fir$l anJ sc^d *n^ ^ » 

the c,.rrc t -« proton ,dcn..f,cai..w. However, the <cuuciut L t M L k IT r w ^ cncc ,n heme 
protein < C r.nc hydmiymciht liranslcrjve * wmlirmcd llu: idcnnu m the 

n» co T ,„ p „ s producc > aSa S^ZS^^C^^T 

r/«/„ 1992 Wilkmv r/«/ tL r L " C " pcciCs « J unahlui 

reference maps „f S,„„, ; „„,„„, , w „ iy , n „„ „ % " h o £ T™ 

«»...«.. mnw. D,rn,,v,,/„„„ ,/,,,„„/,.„„, human scrx he '' 

l> .nphocyre. and ro0Ukt brain (Cor(J „ c|| „ „, "-" - ^«.tam» 

« «/.. 199,: Junpb ,u, „„,.. ,992. 199,: Carre!, „ „. 

SW T EN« T^ F ' CAT ' OK 6V AMIS '° AC ' D ^"TERMINAL 

When samples from 2-D pels are n0 , unambiguously identified hv 



amino acid 
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c .niposition. p! an<l MW. ofif n the con— i id- r 

. JP rank.ngs of ,he Ik, , Hohohn, Hou^™™™!^ r°T * ^ «* 
'"^ <• <"•• I«95i. Takinc advanu,- of ,h, ok ^ Cordtt IW< 

hned Edman degradation and amino acid an.k- • , df VC,0 P«« a com- 

- Wilkin, a „/.. submmed ». Th,> involve^h ^^1^7" '* ' Pr °* ,n to^auon 
rro<c.n. h> Edman degradation for " or 4 cvcl, ! * CqUe,U * ,n - c »^*DF-NoHcd 
wh.ch the same sample used for ammo add anah s^Ta * T*T" ^ »"= 
•--moved trom the pro.cn. composition is no. M.nir * anMnp ^ 
; ,n «onl> a. mall amount of protein sequence is^r i J' - ''""^ Funh *«"»rc. 
i Jman degradation cycle, can he used ^SinSoTf- IW, ° tt ' rercMm ' ? >'^ 
a«ou 3 c> cJe. ,„ he completed in I h. therehv " w k CURm Pr ° CCdur ^ ^ould 
protetnv p, r week on one automated, mult ^ Z ' of 100 ° r 

M«ion. p, and MW of prote.n. are nij^^^^^^ -„,p ( , 
Viermmal sequence., of he,, match,n £ prc^^Z) ahtn «- a » d 

«o confirm the protc.n ,dem„v tFi w *% tT^\ ,hf sC£ ' uen « *=* 

proteins are N-.erm.nalh blocked', hut « 1. * ' ^ U " iH * 
^eepwhle to the acetyl, formyl. or pvroduia'mv, mn^"^ 1 am,n ° ^ ar < 
«h- may „se,f provid; Uxcfu| f ^^^ «J- — blockage, 

of Vterm.nal sequence .ae and am,no acdrnl!! ' "^"ti Heat, on. A trench 
data generated are quicksand ^T i !^ l ^ 0niim iden ' ifi ™«°n - .ha, 

PROTEIN .UEST,r,CA710N B V PEPTIDE MASS FINGERPRINTING 
Techniques for the identification of proteins hv n*n,i^ 

recently hecn described ,Hen 2 el „ al 9 9 V IS t ^ fin Wriniin P have 
James at.. , 993: Mann. Hojrup and Roepiorffwi- v''^ ^ 1 993: 

«/.. I W4: Sutton ,/ „/.. , 995 This invoh es he Il' r!, ' ^ T " ' Mt ™ « 
-nr res,d U c.spe,-ific enzvmes. the det-rn ,n* on of , of fro »' Prmeins ' 

•n f of these ma.es agams, .heoreu ^e^ 

sequence databases As pr0l ,., n , hjVf dtffercn l,no V ttW ' f ro1 ™ 

should pr„du,e charaucr,s„c T.n=erpr,n^ a «"-*qucncev their P c P „dcs 

digests arc reported to produce more enzvme 1 L i " ' " ah,u,l ^- l "" w/««el 
™«e subsequent pept.de n.s, anah " ^ * " ^ ^,h comp,, 
Mcirtt « «/.. I99J,. The cnzvme'of choice fo r R ^"»»-« ,yc; 4 : 

modified -quencinggradei.huto.h.en^^^b^r " ^ -f 

aNo been used .Papp.n. Hojrup and Bta^ iM^^^^'^ 
peptides obtained, it is desirable for Dro.^,n V nm ""'« the number of 

^onds of the pro.e,n are broken, and product 2 Th '\ Cnsur ^ ,haI -« d -»ir.dc 
-"enable to d.geM.on. Surpr.s.n,,, ^ T are „,ore 

hrom.de .meth.on.ne ,pec.f.c, 'fnJl ' ' ™ mClh ° dN suth ^ cv ano . C n 



(Nikodem and Fresco. 1979: Crimm«n< a al loon- v M /w, 

After pr.«em, are directed, pepnde nas^ «i J T "" L ,99: ' 
D«r. anaI> of pepnde mixtures canT^-^ * ■» 

"* h.rher ,ens„n ily anJ crealtfr , oJerance MALDI ^ ^abie because ol 

'James „ „/.. ,99* Mem « «/ h£ plnn^ r"*^ ^""^ froni «* 
more, re.en, modifier .o^^S^ T^"^' 1 ™^ 
difliculue. exper.en.ed ,,,h the -Ived carK 

0rm and N,ann - J ^ V«n„. Roepstorff and o£, £ ^°" 2 " 

•na« ^riiomsin allow, a small fraction of adi« e s lo ! , Irl ■ * SCns,,,vi »> °«" 
lor analyse, and analyse „self is complete „ a ffu m mu eV P '° * U * d 

ma J° r challenge associated with peptide ma« r.»- • "■ • . 

prior to compu.er match.ne « ai „« ?^^ h ^? n | m,n S , ^™«I«i a «i««i 
mu, be examined careful.; , 0 detcrm.S ^2.^2^ *** Spectra 
-ere,. * there are often enzyme ^i^^™' ™<* ~~ of 
Ranees pre%em (HenzeI „/.. , W; Mon " 2 ^ P ^ cl ; jnd "'"^matins sub. 
Furthermore, if pro,e,n alkylation and reduction hi !« h"*™^" " »*»■• 
prote.n di S e«ion. peptide sequence coverao/Zv k unuenakc " Prior to 

ma^e, present repre^un.Ll^^^J^^ 'T ta 70< ~' »■ >ome 
« M on 2 n «/.. 1 904 , For eukarvote.s. a^^™*?** P ™ m in « h < lutein 
b> the present of postradiational mo^^ 

unmodified pept.de alone can be ^ % ^ ° f « hf 

ca„on< introduced, by electrophoresis an acr^am ^!->"° an,fa ""»' "»«lifi, 

oxidation of methionine, are also knou, io«dte?»^?f '° and thc 

He.« «,//.. 199^1 JHerpepiide masses, lcMairen«/ 199V 
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A number of computer programs are available for mairhinr • i 
d«*» tr t vi 5 w e d in Cm*. .994, M^S?^^"^*™ 
.ive manner, whereh; peak* of mu, 500^000 0, ireleCd*^ Z ?f 

numb:, of m,,sed enzynKclnvayesaJloued ,He„z,| „„,.. W:ik££ft£ 
Rasmu S «n « ,,/.. 1 904 ,. The come, protein idemirv i< (he pro.™ whrt, k- .k' 
peptide ma«es in common u iih die unknown anj, i.ul. . i J """ 

« .995. w as ,„ ?CT „,. , Wl ^ ; ^d-n 



MASS SPECTROMETRY SEQUENCE TAGGING * 

An extenston of peptide boss, fingerprinting has recemlv been dewrihed call,,, 
pepudt sequence taking (Mann and Wilm. 1904. M imn I99S, tk. 
mass spectrometry ,MS/MS,,o ini.tally detente ihe^s^; ^"ta 
them to fragmentation by collision with a eas and finall, d»„™ ! . ' 

vequence.TWragmentationmas.sesofp ep,id«eanrur ***E%Z£Z£l 

mass data generated and the h,*h It™. o7«ne„7s f ~ 'he complex ol the 
Seve„he, r . „ represents a useful ^I^^SX^*"' 
increases the power of peptide mas, fingerprinting protein identito,' ,„ * '> 

Cross-species protein identification 

Protein sequence databases continue to <rou ji a ramrf r „. 

.« be Coned and sequenced. Thts is pantculartv trTe " 0 t ^Z' £ ^ 
enzymes involved in slvcolvsis DNa m,„,„..i . CK "P ,n ? proteins, such 
whtch are htgh, conned acr 0 :; s^ho ^ ^iT utr^'T 
identified across specte.s boundarie, « then becom mc to of "1^''"°' * 
characterisation and DNA sequencing effons. P™"'" 



A) 



fri'tress «•///, pm,c,m,c ,„oje as 





E 


.4 


ClX: 


19.3 


Scr: 


Zly : 


4 


• * 




4.3 


Ala: 






9 




€.7 


VaI: 


Il« : 


c. 


0 


leu: 


15.5 


Phe 



£.3 
6.0 
S.5 
2.5 



His: 
Pro: 

l*s: 



1.3 

1.3 
0.1 



Pi Rar.?e: no ran?* sp«ti*ied 
Ra.-^e: ic rir.y aperifaed 



•* are: 



Rank 


Scrrt 






I 


Hw) 


titst 


■ tstt 








«««««»««»! 






apai.humak 


c 


.27 


28078 


2 
3 


4 


APAl.MArFA 

apai.rabit 


5 

c 


.43 

.15 


28005 
27B3€ 


< 

5 




A?x:.Br/iK 


c 


.3fi 


2?S4t 


14 




5 


.10 


274€7 


£ 


16 


apai.ksvse 




. 42 




8 


25 


APAZ.PIC 




19 


27598 




a?ai_cxi=j: 


e ^ 


26 


279€6 


9 






e 


44 


1177*2 




35 


APA4.JTJMAN 




18 


43374 



B) 

.neayer.t; Trypsin ww iil-er: 10 % 
Scan us — frayaer.r mws c f: 



1553 1923 
1301 Z.2S3 
1231 ?96 



:?31 1€13 1401 1387 
12S2 1235 1231 1215 
B73 831 B13 781 



*«»»»«»««»«« 

**^IP0PRC7T2H A- 2. 
**O*XroPlt0TEZK A-l. 
APO^XPOPROTtlK A-I. 
*PO^IPOPROTEtK A-I . 
A »^P0WICT£IN A- T _ 
AP012POPRCTI2K a- 2 * 
A^OLIPOPRCTEIK A- 2. 
^"POPKOTEIK A- 2. 

» zsorom. 

APOLIPOPROTKK A- IV. 



• 1 A. 






APAI.JTJMAK 




APAI.MAr-A 


3 


apa:_?apka 


< 


• B41845 




APAI^CA-VTA 


6 


. S309O 


"> 


HS2C.P£A 


6 


S2CT24 


c 


h:v/:3 54 


10 . 





- HOMO SAPIENS 

- MACACA FASCICULARIS 

- PAP 20 KAKADRYAS 

CAI.-2S TA-ILlARrs (DOG) 



er.rries *=a_med « 72C1B 

APCL2 PDPROTZ2K A-2 IAPO-AI) 

APoi:PD PR crr2K a-: iapo-a2>' 

AP=— PDPRrrr2K A-2 (APD-A2) . 
erf B - Treponema denticola 

APciiPDPRrrriK a-2 <a?o-a2> 

KXWI254 pr«n.=ur. tera . « f 

•nuj PR-rrxN. - Escherichia c~i. ins *«°<i«*icie nr y 

rx^r^ . 

^ compared |f Ihj wmt pro , e , n , J ^ d " Sh". . ? kmf Pr '" e,B< "" m N,,h tan 
anJ H«Kh«rai«r.l»9J.ui« f e,h,,n.eL.^r 1" ,A,0u «m« ExPASv server. A PPe | fiinSh 
rro.cn, were optilipoproicin A-I ofd.fferem «T„ R n * 1 ? ,,m '*- StVeB "» «« '»P 10 mauhl 

arH'l.popm.e.n A-I from d.fferen. ipec.eV 09 ' ^ ° f ** ""P ,c » "«'ch.ng prn.e.n* were 
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Rapid cross-species identification of /wnteins r rom : . D r . fcrenee 
undenaken with am.r.o acid composition or pep-.ide mass finsefprintintmSl^ 
. F/cur, 6 1. but these 'echn.ques alone ma> not identify proteins unambiguous* when 
phylosenetic cross-,p c .,es distances are real or analysis data k of poor qualitv t >•■»« 
« <./- 199?: Shaw. 1993: Corduell „ aL 1995,. However, verv hish con/idenV n 
protein identities can be achieved when Ii>ts of besi-matchinc protein* "n-aied'hv 
both techniques are compared (Corduell er aL 1995: Wowser er at' 1995, The 
correct identification i> found when the same protein is rartked'hichlv in list, of h-<- 
matches generated by both techniques. This method has allowed approximated PO 
proteins from the reference map of the mollicutc Spin,pla.xmtt mclliferum r-presen. 
ins approximaiely one quarter of the proieomc. to be confidentlv identified hv 
reference to protein information from other species ,S. Corduell. Personal Communi 
canon J. When cross-species protein identification is to be undenaken it should be 
noted that the molecular weight of a protein type across species ,s usualh hi«-hlv 
conserved, but that protein pi can van by more than 2 units (Corduell „ „/" joV, 
Accurate molecular weight determination by direct mass spectrometry of proteins 
blotted to PVDF (Eckerskorn „ aL 1992, should therefore be a 
parameter tor cross-species protein identification. * 

CHARACTERISATION Or POST-TRAS'SLATION'AL MODIFICATIONS 

Many protein* are modified after translation. Such post-translationa! modifications 
including glycosylate, phosphorylation, and sulfation (see Tabic 6) are usuuIK 
necessary for protein function or stability. Some abnormal modifications are associ 
ated with disease (Duthel and Revol. 1993: Ghosh er aL 1993: Yamashita a «/ 
1 993 j. In proteom; siud.es. post-translational modifications can be examined on all 
proteins present, or on individual spots. Studies on all proteins provide an indication 
of which proteins may carry a certain type of modification. For example ^-D -cl 
anulys.s 0 f cell cultures grown .n the presence of f'H] mannose or |"P1 phosphate 
give, an indication of u h.ch prote.ns earn- glycan* contain.nc mannose. and which 
prote.nsare phosphorylatedt Carrels and Franza. 1989). Leciin hindmc studies of 
ccK blotted to PVDF or n.trocellulose provide information on the saccharides, if anv 
that are carried by proteins present (Gravel ei a!.. 1994). * " ' 

When indn .duui protein* of interest earn ing ppst-translai.onjl modif.cai.ons have' 
hecn found, nwcropreparaiive 2-D electrophoresis can be uxid to purifv them m 
microgram quantum iHanash ci aL 1991: Bjellqvist cr 1993h. If protein 
inform, of similar MW und pi are .0 be stud.ed. focus.nc u„h narrow J^e n| 
.Gradients ( I pH un.n can prov.de greater separation and resolution. After electro- 

rn ?7?; ,h ,lL?% an i d , d " r " ° f Pr ° ,C,n P ho ^ hor > »-"'on "n he inve«i Ba ,ed iMunhv 
and Iqbal 991: Gold er aL 1994,. monosaccharide composition can be determined 
ei.zhandler e, aL 1993: Packer e, aL 1995,. and the structure and exact™ of 
clvcoamino acid> can be investigated b> cither Edman dccradat.on based tcchnmues 
or by mass spirometry « Pisano et uL 1 993: Hubeny „ U L 1 993: Carr. Huddles.on 
and Bean. 1993, U„h further development of rapid techniques, .nvest.cation of 
phosphorylation and monosaccharides by chromatographic or mass speciromctric 

2;"^ *° blCOmt r a rouUne sle P in ** characterisation of post-translational 
modifications of proteins from reference maps. 
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The stattj of proteome projects 

Many technical aspects of proteome research have air*-Hv k- 
^vteu.l.utanovemeu ofthe.tatu.ofprote oLp£^£" d, l *"»* d in *** 
Advance, in proteome protect, wi„ uLvZ?^?^'!^ 
. .iiiatives. ,o enable an identity, am.no acid seqCeo^n ion 0 T 
each pro;ein spot. Table 7 shows een0 me size ? ** a5S,?ned 10 

protein, already deAned for a numfcr of mo STJET 7 ' """^ ° f 
genome sequencinc proerams for£ coliJdl ° r * Mnuna - ^ ,nci,Ci » es that whilst 
Mze of erne other" LomeT, and e<p e l^^™™*^**^ 
compter nucleotides^ mean < lha < 

this. 2-D .eference maps and pro^^^^^y^^^ot 
Plasma sp.. £. coli and J. ^iJSfl, ^nl ^m 6 !, " kC 
Wasinger „ , 995: Vanbogelen '»* 
map, of other organisms will taJce lonser to consuun u C ° mp,Ctc 

S«i« protein .dentification .echn.ques I £ 3oSSeT? "* 
and simple eukaryo.es .o be pan.allyVmed ^^^^^^ 

«i«n, e m aP c for ,nm : mode! nrMnnmv Gtrnnm- S£ *' k r W " ' JCnU,> " n :D 

Species Name 



Escherichia enh 
Sarchntnm\tfK rrtrvtuar 
Om\ fi*trit, tir diu fndrttm 
Atftttttiftpxts tuQitmtc 
Cttcnttt htitwmx cirvtitt\ 
Httnitt <(tptrns 



Haploid 
cenomcsSize 
(million bpi 

O.N-0.8 
4.8 
I.V5 
7n 
70 
80 
2900 



Estimated 
proteome size 
unul proteins I 

•HOC) 
ftoon 

12500 
I40(jf| 
17X00 
6UOO0-R0OU0 



Protein 
cmne< in 
SWISS PROT 

UK) 

3 IN) 

:~n 

352ft 



annotated tin 
2-D Map* 

> I IX) 

> MK) 

> 100 



> MKK) 



^^W^^ZS^"^ How,,,. „ K 
•o u«ie. cons,„uu„ S d» -nousekeepin,' p rae i„ ,B d W v^T fr ° m < 
ProKms cons.Hu.ms » «t lha, o, e specific ,o a eel , ™ p™ , L * re,ni " n *' °' 

« k. supenmposed in ce | dST," P Hlr ! '' SSK : 0fMt ^- 
^re untque to different tissue tvo-, Wh .m/ ? Cl of P ro,e| n<»hat 

"«ful „ p,ov,di„5 ,„ cus to „ uclejc ac , d «, uencj ~ «i« He 
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FLTL'RE DIRECTIOVS OF PROFONifc f-KOJECTS 

This review ha> described recent advance* in ih- «r 

pcwble me «*!,»:„, of eo «Kce Z^fa """^ Th " ta ' m «* 
hecom,„ f .he mchod of cho,cc for ,hc defi^o JS^T ^^5^.,'***, " 

investigation of gene expression therein. cs,,v anJ thtf 

Proieome proiecu are alreadv imoacii-'o nn ,k_ j , 
DNA sequence consuls the delSE "o Z^^^T^ ** 
of different tissues of a single oreanism are of,Z *r . P ' C - lhe P^eomes 
cross-spec.es .denuncat.onof "^ 

from Can dlda alhlcai , hv comparison ^ T^^J ™^" 

organisms thai are pooriv moiecularlv defined a7 PC " up stud,e > on 

proceed at a pace orders of r^Jt^^^^ 

defining the gene and prote.n compleni^rrfT * T in ,crn,s of 

sequencing o Genomes u-i|, D S a'STd ^ 7 aB,n ! S - * f ° r lhc DNA 
novel. ' «oe avoided, -nd emphasis.placed on those found to he 

Just as genome sequencing is not an end in itself n#i,,w 
reference map of an or f an.sm. „ 0 r ,^J^^^'^™ c,i : " D ■«*» 

So wh.ls, an immtdiale aim of OTSt»^r^ e ' ns,o . i ' pro, ' oTO - 

maps, this ..ill lead ,o expression siudiA ,? " l " 0 " ; "" ref "«« 

"^^•ions. The cha^eZ*^ 

^mT^C^^ 

HI were ,„,„all. of no known f u „c„on <0»„™ f&?£^'% m ™ m 
Mud.e. will he an undenakmf JU s, a, formidaole'a ,e"nom ™ t Z ' 
proieome pro.eet. are becomin- bul will .„ , *- nomt s,u °'^ no., and 
of how „,,„ s or ? an„m; ^^"J^- 
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ABSTRACT Analysis of cellular protein patterns by 
computer-aided 2-dimensional gel electrophoresis together 
with recent advances in protein sequence analysis have 
made possible the establishment of comprehensive 
2-dimcnsional gel protein databases that may link pro- 
tein and DNA information and that offer a global ap- 
proach to the study of the cell. Using the integrated ap- 
proach offered by 2-dimensional gel protein databases it 
is now possible to reveal phenctype specific protein (or 
proteins), to microsequence them, to search for homology 
with previously identified proteins, to clone the cDNAs, 
to assign partial protein sequence to genes for which the 
full DNA sequence and the chromosome location is 
known, and to study the regulatory properties and func- 
tion of groups of proteins that are coordinate] y expressed 
in a given biological process. Human 2-dimensional gel 
protein databases are becoming increasingly important in 
view of the concerted effort to map and sequence the en- 
tire genome. Celis, J. E.; Rasmussen, H. H.; Leffers, 

H.; Madsen. P.; Honore, B.; Gesser, B.; Dejgaard, K.; 
Vandekerckhove, J. Human cellular protein patterns and 
their link to genome DNA sequence data: usefulness of 
two-dimensional gel electrophoresis and microsequencine 
FASEB J. 5: 2200-2208; 1991. 
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Proteins synthesized from information contained in the 
DNA orchestrate most cellular functions. The total number 
oi proteins synthesized by a typical human cell is unknown 
although current estimates range from 3000 to 6000. Of 
these, as many as 70% may perform household functions 
and are expected to be shared by all cell types irrespective of 
their origin. There are many different cell types in the hu- 
man body with perhaps 30.000 to- 50,000 proteins expressed 
1 n n f _ the ° r?anism as a whoIe judged from the fact that about 
37c oi the haploid genome correspond to genes. Todav onlv 
a small fraction of the total set of proteins has been identified, 
and little is known about the protein patterns of individual 
cell types or their variation under physiological and abnor- 
mal conditions. 

For the past 15 years, high resolution 2-dimensional gel 
electrophoresis has been the technique of choice to deter- 
mine the protein composition of a given cell type and for 
monitoring changes in gene activity through quantitative 
and qualitative analysis of the thousands of proteins that or- 
chestrate various cellular functions (refs 1-6 and references 



therein). The technique originallv described by OTarrell . 
separates proteins in terms of their isoelectric point ^pl) an. 
molecular weight. Usually one chooses a condition of in- 
terest and the cell reveals the global protein behavioral 
response as all detected protein5 can be analyzed both 
qualitatively and quantitatively in relation to each other. A: 
present, most available 2-dimensional eel techniques ^regu- 
lar gel format) can resolve between 1000 and 2000 proteins 
from a given mammalian cell type, a number that cor- 
responds to about 2 million base pairs of coded DNA. Lev 
abundant proteins can be detected b\ analyzing partial! 
purified cellular fractions. 

Two-dimensional gel ectrophoresis has been widely applied 
to analysis of cellular protein patterns from bacteria to mam- 
malian cells i refs 1-6. and references therein). In spite of 
much work, however, information gathered from these 
studies has not reached the scientific community in its full- 
ness because of lack of standardized eel svstems and the lack 
of means for storing and communicating protein informa- 
tion. Onlv recently because of the development of appropri- 
ate computer software (7-13), has it been possible to scar 
gels, assign numbers to individual proteins, and store the 
wealth of information in quantitative and qualitative com- 
prehensive 2-dimensional gel protein databases (4, 14-23). 
i.e.. those containing information about the various proper- 
ties (physical, chemical, biological biochemical, physiologi- 
cal, genetic, immunological, architectural, etc.) of all the 
proteins that can be detected in a given cell type. Such in- 
tegrated 2-dimensional gel protein^databases offer an easy 
and standardized medium in which to store and communi- 
cate protein information and provide a unique framework in 
which to focus a muhidisciplinarv approach to study the cell. 
Once a protein is identified in the database, all of the infor- 
mation accumulated can be easily retrieved and made availa- 
ble to the researcher. In the long run, protein databases are 
expected to foster a wide variety of biological information 
that may be instrumental to researchers working in many 
areas of biology- among others, cancer and oncogene 
studies, differentiation, development, drug development and 
testing, genetic variation, and diagnosis of genetic and clini- 
cal diseases {Tig. 1 ). 

The approach using systematic 2-dimensional gel protein 
analysis has recently gained a new dimension with the ad- 
vent of techniques to microsequence major proteins recorded 
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Figure 1. Interface between partial protein sequence databases, 
comprehensive 2-dimensional eel databases, and the human ee* 
nome sequencing project. Appropriate software is required to com- 
pare protein and DNA sequences. In general, although the infer- 
ence of a protein's sequence from the DNA sequence (thick arrow ) 
is direct and unambiguous, the DNA sequence can only be inferred 
.approximately from the protein sequence (thin arrow) and cloning 
>f the gene requires either a cDNA or the requisite group of 
jligonucleotide probes deduced from the partial amino acid se- 
quence. Modified from ref 6. 

in the databases (refs 24-42 and references therein). Partial 
protein sequences can be used to search for protein identitv 
as well as to prepare specific DNA probes for cloning as-vet- 
uncharacterized proteins (Fig. 1). As these sequences can be 
stored in the database (see for example Fig. 2H). thev offer 
i unique opportunity to link information on proteins with 
he existing or forthcoming DNA sequence data on the hu- 
man genome (Fig. 1) (20. 36. 39). 

Using the integrated approach offered by comprehensive 
2-dimensional gel databases (Fig. 1), it will be possible to 
identify phenotype-specific proteins: microsequence them 
and store the information in the database: search for homol- 
ogy with previously characterized proteins: clone the 
cDNAs. assign partial protein sequences to genes for which 
the full DNA sequence and the chromosome location are 
known, and study the regulatory properties and function of 
groups of proteins (pathways, organelles, etc.) that are coor- 
dinated expressed in a given biological process. Comprehen- 
sive 2-dimensional gel protein databases will depict an in- 
tegrated picture of the expression levels and properties of the 
thousands of protein components of organelles, pathways, 
and cytoskeletal systems in both physiological and abnormal 
conditions and are expected to lead to identification of new 
regulatory networks in different cell types and organisms. In 
the future. 2-dimensional gel protein databases mav be 
linked to each other as well as to national and international 
specialized databanks on nucleic acid and protein sequences, 
protein structures. NMR experimental data, complex carbo- 
hydrates, etc. 

A few 2-dimensionaJ gel protein databases that are accessible 
in a computer form have been published in extenso: these 
correspond to the protein-gene database of Escherichia coii 
K-12 developed by Neidhardt and colleagues (14. 23). the rat 
REF 52 database established by Carrels and co-workers at 
Cold Spring Harbor (18. 22). and a few human databases 
(transformed amnion cells [15. 20]. normal embryonal lune 
MRC-5 fibroblasts [17. 21]. keratinocytes [19] and peripheral 
blood mononuclear cells [15]) developed in Aarhus. Given 
space limitations and to keep this review in focus, we will 
concentrate on the computerized analysis of human cellular 
2-dimensional gel patterns, and in particular on the steps in- 
volved in establishing comprehensive 2-dimensional gel 
databases that can link protein and DNA information. 



MAKING AND MANAGING A COMPREHE\<IVK 
2-DIMENSIONAL GEL DATABASE OF HLAlYv 
CELLULAR PROTEINS 

The first step in making a comprehensive 2-dimcnsu>na: o . 
protein database is to prepare a synthetic image ■ ciicirn: nv::: 
ol the gel image* of the gel (Huoroeram. Cuomassie bine or sil- 
ver stained gel) to be used as a standard or master referent l-. 
This can be done with laser scanners, charee couple device 
(CCD) : array scanners, television cameras, rotating drum 
scanners, and multiwire chambers iIji. Computerized anal- 
ysis systems for spot detection, quantitation, pattern match- 
ing, and data handling (access and retrieval of information, 
database making) have been described in :he literature 
(ELSIE [43). GELLAB [11]. HERMeS |44j. MELAMF 
[10]. QUEST (9). and TYCHO [8]) and some are available 
commercially (PDQUEST. Protein Database Inc.. Hunting- 
ton. N.Y.: KEPLER, Large Scale Biology. Rorkvillc, Mif : 
Visage. Biolmage Corporation. Ann Arbor. Mich.: Gemini. 
Joyce LoebL Gateshead: Microscan 1000. Tcchnologv 
Resources Inc.. Nashville. Tenn. and MasterScan, Billcrica. 
Mass.). Unfortunately, most of these systems are incompati- 
ble with one another and ihcir advantages and disadvantages 
have been discussed by Miller (13V 

In our work station in Aarhus. "fiuorogrnms are scanned 
with a Molecular Dynamics laser scanner and the data are 
analyzed using the PDQUEST II software (Protein Data- 
bases Inc.) (12) running on a spark station computer 4100 
FC-8-P3 from SUN Microsystems. Inc. The scanner meas- 
ures intensity in the range of 0-2.0 absorbance. A tvpical 
scan of a 17 x 17 cm fluorogram takes about 2 min. Steps 
in image analysis include: initial smoothing, background 
substraction, final smoothing, spot detection, and fitting of 
ideal Gaussian distribution to spot centers. Spot intensity is 
calculated as the integration of a fitted Gaussian. If calibra- 
tion strips containing individual segments of a known 
amount of radioactivity are used, it is possible to merge mul- 
tiple exposures of the sample image into a single data image 
of greater dynamic range. Once the synthetic image is 
created it can be stored on disk and displaved directly on the 
monitor. Functions that can be used to edit the images in- 
clude: cancel (for example, to erase scratches that mav have 
been interpreted as spots by the computer: cancel streaks or 
low dpm spots), combine (sometimes a spot mav be resolved 
into several closely packed spots), restore, uncombine. and 
add spot to the gel. The process is time consuming- about 
1-1/2 day per image. Edited standard images can be matched 
to other svnthetic images. Figure 2A shows a portion of a 
standard synthetic image (IEF) of a rluorogram of 
[ 35 S]methionine labeled cellular proteins from human AMA 
cells (master database) (20V Images can be displaved either 
in black and white (resembling the original Huorograms) nr 
in color pother images in Fig. 2). depending on the need. As 
shown in Fig. 2B, each polypeptide is assigned a number by 
the computer, which facilitates the entry and retrieval of 
qualitative and quantitative information for any given spot 
in the gel (20). The standard image can be matched auto- 
matically by the computer to other standard or reference gels 
(Fig. 2C. matching of AMA cellular proteins [left] to MRC-5 
proteins [right]) provided a few landmark spots are given 
manually as reference ( indicated with a * in Fit;. 2C) to in- 
itiate the process. 



^Abbreviations: CCD. charee couple device: PCNA. proliferat- 
ing cell nuclear antigen: HPLC. high performance, liquid chromaioe/ 
raohv. 



Figure 2. .-h Synthetic image of a fract.on of an IEF cd of the master „n;icc oi AM A cdiular protein* A, a, ■ „ 1 h,„ «h ■ i 
assigned to each sdoi nCnmnancnn n f A\i i /i.r, , . . . . , n tuiuiar proturu. /i i a> m A but shnuini: numbers 
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The automatic matching process that has been described 
in detail by Garrels et al. (1ft) takes about 5 min. Matched 
proteins are indicated with trie same letters in both gels (Fig. 
2C). The usefulness of this function is emphasized by the fact 
that data accumulated on common household* proteins can 
be easily transferred to any other human cellular cell type 
whose 2-dimensional gel ceilular protein pattern is matched 



to our standard AMA 2-dimensional gel protein image. Al- 
ternatively, if the standard gel is part of a matchset (set f 
gels in a given experiment) it can be used as a linker gel t 
compare, for example, the quantitative values of a given pro- 
tein throughout the experiment (see Fig. 2D; levels of some 
proteins in normal and SV40 transformed human MRC-5 
fibroblasts) or with other standard images in different sets of 



cross-matched experiments (18, 22). 

Once a standard map of a given pr tein sample is made 
one can enter qualitative annotations to make a reference 
database. Our master 2-dimensionaJ gel database of trans- 
formed human amnion cell (AMA) proteins (20) lists 3430 
polypeptides of which 2592 correspond to cellular compo- 
nents, having pi's ranging from 4 to 13 and molecular 
weights between 8.5 and 230 kDa. The most abundant pro- 
teins m the database correspond to t tal actin (3.87% of total 
protein; about 90 million molecules per cell) while the 
lesser abundant of the recorded polypeptides are present in 
the vicinity of 5000 molecules per cell. Some annotation 
categories we are using to establish the master AMA data- 
base include: 1) protein identification (emigration with 
purified proteins, 2-dimensional immunoblotting, microse- 
quencing); 2) amounts (total amounts and levels of synthe- 
sis); 3) subcellular localization (nuclear, cytoskeletal, mem- 
brane, membrane receptors, specific organelles, etc.)- 4) 
antibodies; 5) postradiational modifications (phosphoryla- 
tion, glyesylation, methylation etc.); 6) microsequencing- 7) 
cell cycle specificity (specific variations in levels of synthesis 
and amount); 8) regulatory behavior (effect of hormones 
growth factors, heat shock, etc.) 9) rate of synthesis in nor- 
mal and transformed cells (proliferation sensitive orotein. 
ce.. cycle specific proteins, oncogenes, components of the 
pathway (or pathways) that control cell proliferation)- 10) 
function (mainly from emigration with proteins of known 
Junction); 11) sets of proteins that are coordinately regulated 
(hierarchy of controls, differential gene expression in various 
cells etc.); 12) cDNAs (cloned cDNAs); 13) proteins that are 
specific to a given disease (systematic comparison of protein 
patterns of fibroblast proteins from healthy and diseased in- 

r^M* , ) k 7 ^ ) l expression and «Plo«ation of transfected 
cDNAs; IS) pathways (metabolic, others); 16) gene localization 
(genetic and physical); 17) effect of microinjected antibody 
on patterns of protein synthesis; and 18) secreted proteins 
Information entered for any spot in a given annotation 
category can be easily retrieved by asking the computer to 
display the information on the color screen. For example 

a i^i a a v !? nthe,ic ima S e of a NEPHGE gel (master 

AMA database) displaying the information contained under 
the entry glycolytic pathway. Alternatively, one can use the 
function peruie annotations for spot to directly ask the com- 
puter to list all the entries available for a particular protein. 
By clicking the mouse in a given entry (in this case, presence 
in fetal human tissues) it is possible to take a quick look at 
the information in that particular entry (Fig. 2F) 

A major obstacle encountered in building comprehensive 
2-dimensional gel protein databases is identifying the large 
number of proteins separated by this technology. In our 
databases (20, 21), known proteins are identified by one or 
a combination of the following procedures: 1) emigration 
with known proteins, 2) 2-dimensional gel immunoblotting 
"sing specific antibodies, and 3) microsequencing of 
Coomassie Bnllant Blue stained human proteins recovered 
ZZ? 2 - di ™nsional gels (see next section). Protein 
.dentification by means of microsequencing may be difficult, 
as individual protein members of families with short peptide 

tZ rC ?p $ ff 3pe dc,cction - I" ^e gene-protein data- 
base of £. coh « - 2 (14, 23), another major 2-dimensional gel 
database available at present, proteins are being identified by 
a wider range of tests that include emigration with purified 
proteins; genetic criterion (deletion, insertion, frameshift 
nonsense, m.ssense, regulatory), plasmid-bearing strains 
and in vitro synthesis of protein; selective labeling (methyla- 
"on, phosphorylation); peptide map similarity; and physio- 
logical criterion and selective derivatization 



So far we have received nearly 550 antibodies from bbo- 
i ' JET ,he world a " d ">«e ^e being svstemi" a '-K 
csted by 2^,mensional gel immunoblottingfor an^ n d 

S ' miJar, > ; P urified P^^ns " and orinelt 
provided by several laboratories have greatly aided identic 
uon of unknown proteins (20721). We routinely request am,- 
bodies and protein samples and promise the donors to make 
available all the information we mav haw accumulated on that 
particular protein. For example, fable 1 lists entries availa- 
ble for Lipocort.n V (IEF SSP 8216). also known as annexin 
. ' VA Y* endonexin II. renoconin. chromobindin-5' an- 
ticoagulant protein. PAP-I, ycalcimedin. IBC. calphobindin 
and anchorin CII. 

As mentioned previously, one distinct advantage of 
.-dimensional gel electrophoresis is the possibility of «udv- 
ing quantitative variations in cellular protein patterns that 
ma> lead to identification of groups of proteins that are ex- 
pressed coordinately during a given biological process 
Quantitation however, is not an easy task as reflected bv the 
lack of published data on global cellular protein patterns. We 
believe this is partly due to difficulties in obtaining sets of 
gels that are suitable for computer analysis (streaking 
material remaining at the origin, etc.) as well m Hmita^ 
tions (.aborious editing time, need of calibration strips to 
merge images, limited dynamic range, etc.) in the computer 
analysis systems available at the moment. Perhaps the most 
advanced quantitative studies published so far using com- 
puter analysis have been carried out by Carrels and co- 
workers (18, 22). In particular, these investigators have estab- 
lished a quantitative rat protein database (18, 22) designed 
to study growth control (proliferation, growth inhibitors and 
stimulation) and transformation in well-defined groups of 
cwVn"", 0 amed by transf °rmation of rat REF52 cells with 
SV40, adenovirus, and the Kirsten murine sarcoma virus 
These studies have revealed clusters of proteins induced or 
repressed during growth to confluence as well as groups of 
transformation-sensitive proteins that respond in a differen- 
tial fashion to transformation by DNA and RNA viruses A 
most interesting feature of this quantitative database is the 
discovery of a group of coregulated proteins that show simi- 
lar expression patterns as the cell cvcle-regulated DNA repli- 

?PrNAU 0te , m iTfT 35 P roliferatin S ce » ""clear antigen 
(PCNA)/cyclin (45). s 

In our human databases, most quantitations have been 
carried out by estimating the radioactivity contained in the 
polypeptides by direct counting of the gel pieces in a scintil- 
lation counter (20, 21). Up to 700 proteins can be cu, out 
through appropriate exposed films in a period of time com- 
parable to that required for editing a synthetic image. 
Manual quantitation of this large number of spots is difficult 
without the assistance of a master reference image and a 
numbering system that can be used to identify the spots Us- 
ing this approach, we have recorded quantitative changes in 
the relative abundance of 592 [»S]methionine-labeled pro- 
teins synthesized by quiescent, proliferating, and SV40 
transformed human embryonic lung MRC-5 fibroblasts (21) 
Some data concerning cytoskeletal and cvtoskeletal-related 
proteins are presented in Fig. 2G. Our studies as well as 
those of Garrels and co-workers (18, 22) may in the long run 
help define patterns of gene expression that are characteristic 
ol the transformed state. 

OTHER 2-DIMENSIONAL GEL PROTEIN 
DATABASES 



As mentioned previously there are other 2-dimensional gel 
databases available in computer form that have been pub- 
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TABLE 1 . Some entries Jot lipoeortin V ir. the human AM A 2 'dimensional gel protein database 



Knirtt-i lor hpocomn V d£F SSP S2l6; 



Iniormauon entered 



1 . Protein name 



Percentage of total protein 



Lipoconin V. renoconin. chromobindin-5'. endonexin I. anticoacuian: prou-:::. 
PAP-I. \AC-q. 35-7-calcimedin. IBC. calphobindin I. anchorin CII. annexe V 
0. 1107c (about 2.800.000 molecules per celH 



.'i. Apparent molecular weight (mri 

4. Isoelectric point ipl) 

V Method (or methods) of identification 

6. Credit to investigators that aided in 

identification 

7. Antibody against protein 

H. Comigration with human proteins 
9, Cellular localization 

10. Calcium/phospholipid-dcpendcnt 

membrane proteins 

11. Function 



33.3 kDa 
4.76 

Microsequencing. 2-dimensionaJ immunoblotting. Comicration 
G. Bauw. J. Vandekerckhove. and colleaeues. Riiksunix ersueit Gent: 



B. P. 



BIOGEN. Cambridge; N.G. Ahn. University of Washington 
Polyclonal (rabbit, antibody no. 20). B. Pcpinsky. BIOGEN. Cambridge 
Lipoconin V.N.G. Ahn. Howard Hughes Medical Institute. Washington Uiumtmiv 
Subcortical membrane 
Lipoconin \" 



Regulation of various aspects of inflammation. : 
and differentiation 



mrnunc icsponn-. binm! t oautu.m. »r, 



12. Partial amino acid sequence 

1 > e D X A s eq u e n ee 

14. Levels in ietal human tissues 



13. Levels in quiescent, profile rat inc. and 
transformed MRC-5 fibroblasts 

lo. Distribution in Triton supernatant and 
c\ toskeletons 



GTVTDFPGFDER <7-l8V VLTEIIASR « 109-1 17V QVYEEEYOSSLEDOVVC; 
( 127-143). ?GTDEEKFITIFGT(R) (187-201) 

Known. R. Blake et al..y Biol. Chrm. 263. 10799-10811: jyKS 
(pi = 4.76 irom translated sequence) 

Adrenal elands ; brain = ; 

cerebellum = * - - ; ear - - - - ; eve = * - » • 

heart = - - - : hvpophvsis - * * * ; liver * * * - ; 

lung = - - * ; meninges - - * - : 

mesonephnc tissue = ♦ * * ; 

striated muscle = * * - : pancreas = - * * ; 

skin * ; spleen = . * . : stomach = ; 

submandibular gland - * * • ; 
small intestine = - * » ; thvmus = - - - ; 
thyroid gland - * * - ; tontjue = - * * ; 
ureter = - » - 



Qtquiescent) = 1.1; P (proliferating) 
T (SV40 transformed) = 0.3 

Mainly supernatant 



= 1.0: 



lished in extenso: these correspond to the E. coli K-12 
proiein-gcne database (14. 23) and to the rat REF52 data- 
base (18. 22). 

The £. coli K-12 cellular protein-gene database is perhaps 
the most complete of ail databases reported so far and even- 
tually it should trace each protein back to its structural gene. 
Information contained in this database includes: gene/pro- 
icin name (protein name. EC number, gene name): 
2-dimensional gel spot designations (x-y coordinates from 
reference gels, alphanumeric designation); genetic informa- 
tion (linkage map location, physical map location. Genebank 
rode, sequence reference, location on Kohara clones): bi- 
ochemical information (molecular weight, pi. number of 
residues of each amino acid, mole percent of each amino 
acid, total number of amino acids in a polypeptide), and 
regulatory information (cellular level of protein in different 
media and different temperature, member of regutbn. mem- 
ber of stimulon). Major advances of this database are en- 
visaged in the future in view of the eminent sequencing of 



the whole £. coli genome as well as the development of im- 
proved methods to express cloned genes. 

The rat REF52 2-dimensional gel protein database lists 
about 1600 proteins that have been recorded using the 
QUEST analysis system (18, 22). Included in this quantita- 
tive database are J) protein names (cyioskeletal and heat 
shock proteins as well as various nuclear, mitochondrial, and 
cytoplasmic proteins), 2) annotations (subcellular localiza- 
tion, modification, recognition by specific antibodies, 
coprecipitation. NH r terminal sequence, cross-reference to 
protein sequence information and references to the litera- 
ture). J) protein sets (cytoskeletal proteins, phosphoproteins. 
sets of proteins with PCNA/cyclin-likc properties, etc.) and 
4) general quantitative data (protein synthesis during growth 
of normal REF52 cells to confluence and quiescence, and af- 
ter restimulation of growth-inhibited cells). 

In addition to the 2-dimensional gel databases mentioned 
so far there are several smaller cellular databases being es- 
tablished in human (normal human diploid fibroblasts, lvm- 



phocytes. leukocytes. leukemic cells) mouse (XIH/3T3 cells, 
T lymphocytes). Aplysia. yeast (Saccharomvces eercvisae), plants 
(wheat, barley, sorghum), and Euglena. Databases of tissue 
protein, (brain, whole mouse, liver) and body fluid proteins 
(plasma proteins, cerebrospinal fluid, urine, and milk) are 
being established in several laboratories. The reader is 
directed to the review by Celis et al. (4) for details and refer- 
ences concerning these databases. 



MICROSEQUENCIXG HAS ADDED A NEW 
DIMENSION TO COMPREHENSIVE 
2-DIMENSIONAL GEL DATABASES: A DIRECT 
LINK BETWEEN PROTEINS AND GENES 

The development of highly sensitive amino acid gas-phase or 
liquid-phase sequenators (24), together with the establish- 
ment of efficient protein and peptide sample preparation 
methods, has opened the possibility to perform a systematic 
sequence analysis of proteins resolved by 2-dimensional gel 
electrophoresis. Indeed, generated pieces of protein se- 
quences can be used to search for protein identity (compari- 
son with available sequences stored in databanks) as well as 
for preparing specific DNA probes for cioning of as yet un- 
characterized proteins (Fig. 1). In addition, partial protein 
sequences can be stored in 2-dimensional gel databases (for 
example, see Fig. 2H) and offer a unique link between pro- 
teins and genes (Fig. 1). 

In the early 1970s gel electrophoresis was used to purify 
proteins for sequencing purposes (reviewed by Weber and 
Osborn in ref 25). Proteins were recovered by diffusion and 
sequenced by the manual dansyl-Edman degradation at the 
nanomole level. This technique was further refined by using 
electro-elution to recover proteins and by miniaturizing the 
system (26). This method has been used extensively, but 
showed increasing drawbacks (low yields, protein samples 
contaminated by free amino acids, and NH 3 -terminal block- 
ing) as the amounts of handled protein gradually became 
smaller (e.g., at the 10 picomol level). 

Most of the problems referred to above have been 
minimized with the introduction of protein-electroblotting 
procedures (27-32). When proteins are blotted on chemi- 
cally inert membranes, it is possible to sequence the immobi- 
lized proteins directly without additional manipulations. 
Thus, depending on the amount of bound protein and its na- 
ture, this direct sequencing procedure generally yields NH 2 - 
terminal sequences containing 10-40 residues. As such, this 
technique was used to identify, by their NH 3 -terminal se- 
quences, differentially expressed major proteins from total 
cellular extracts separated on 2-dimensional gels. A major 
difficulty encountered in this procedure is the occurrence of 
frequent anefactual blockage of the proteins. Several studies 
suggest that this phenomenon is mainly due to reaction with 
contaminants (particularly unpolymerized acrylamide 
present in the gel) and to a high dilution of the protein (low 
concentration of the protein per unit membrane surface). In 
addition to this primarily technical problem, many proteins 
are blocked in vivo by acylation or by a pyrrolidon carboxylic 
acid cap. 

The problem of partial or complete NH r terminal block- 
age can be circumvented by generating internal amino acid 
sequences. This is achieved by fragmenting the protein 
present in the gel (gel in situ cleavage) or by cleaving it while 
bound to the membrane (membrane in situ cleavage) 
(33-35). In both cases, proteins are either cleaved in a res- 
tricted way (e.g., by limited enzymatic digestion or by using 
restriction chemical cleavage conditions) or fragmented into 
smaller peptides. 
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Of -the different combinations examined, we had - H , ; 
results by using exhaustive proteolytic dicestior/ 0 1" 
membrane-immobilized proteins. This method has br~- 
described for Ponceau red-stained proteins on nitroceiiuii.V.' 
blots (34), for Amido-black^tained Immobilon-bounc pr . 
teins. and for fluorescamine^detected proteins on glass rib-, 
membranes (35). The proteases used (trypsin, chymotrvpsn. 
or pepsin) cleave at multiple sites, generating small peptide 
that elute from the blot into the digestion buffer from which 
they are purified by reversed-phase hieh performance liquid 
chromatography (HPLC) before being sequenced individu- 
ally Although each of these manipulations could be expected 
to result in a reduced yield of final sequence information, we 
were surprised thai the peptides could be sequenced with 
high efficiency In our hands, this approach could be rou- 
tinely applied to gel-purified proteins available in amount* 
ranging from 5 to 10 /ig. and often vielded sequence informa- 
tion covering more than 307r of the total protein. As 
membrane-immobilized proteins are not homogeneously 
digested, but rather show protease sensitivitv next to resis- 
tant regions, the number of peptides generated is much lower 
than expected from the number of potential cleavage sites. 
Consequently. HPLC peptide ehromaiograms arc less com- 
plex and most peptides can be recovered in pure form. 

As only limited amounts of -a protein mixture can be 
loaded on a 2-dimensional gel. proteins of interest are often 
obtained in yields insufficient for the currently available se- 
quencing technology. More material can be obtained bv en- 
riching for a certain subcellular fraction (purified cell or- 
ganelles) or by exploiting affinity (dyes, metals, drugs, etc) or 
hydrophobic properties of proteins before gel analysis. All of 
the sequencing results accumulated so far in the human pro- 
tein database (20) (a few are shown in Fig. 2H) have been 
obtained from analysis of protein spots collected from 
2-dimensional gels that had been stained with Coomassie 
blue according to standard procedures and dried for storage. 
Proteins are recovered from the collected gel pieces by a 
protein-elution-concentration device, combined with gel 
electrophoresis and electroblotting. Details of this technique 
have been reported in a previous communication (42) and a 
brief outline is given below. 

Combined gel pieces are allowed to swell in gel sample 
buffer (a total volume of 1.5 ml). The gel pieces combined 
with the supernatant are then collected into a large slot made 
in a new gel. The slot is further filled with Sephadex G-10 
equilibrated in gel sample buffer. During consecutive gel 
electrophoresis, most of the electrical current passes on the 
side of the slot instead of passing through the slot. This 
results in both a vertical stacking and horizontal contraction 
of the protein band. With this device the protein is efficiently 
eluted from the gel pieces and concentrated from a large 
volume into a narrow spot. The highly concentrated (about 
5 mm 2 ) protein spot is then electroblotted on PVDF- 
membranes. stained with Amido black, and in situ digested 
with trypsin. The peptides generated during digestion elute 
from the membrane into the supernatant, and can be sepa- 
rated by narrow bore reversed-phase HPLC and collected in- 
dividually, for sequence analysis. 

Using this and previous procedures (37, 39, 42), we have 
so far analyzed 70 protein spots collected from 
2-dimensional gels (20, and unpublished observations) (see 
for example Fig. 2H). The sequence information amounts to 
2100 allocated residues corresponding to an average of 30 
residues per protein spot. So far we have made cDNAs of 
many of the unknown proteins that have been microse- 
quenced. and a substantial number has been cloned and se- 
quenced. AH available information indicates that it may be 
possible to obtain partial sequence information fr m most of 
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the proteins that can be visualized by Coomassie Brillant 
Blue staining. 

Partial protein sequences are stored in the database as dis- 
played in Fig. 2H. and it should be possible in the near fu- 
ture to interface this information with forthcoming D.\A se- 
quence data from the human genome project. In the long 
run. as the human genome sequences become available it 
will be possible to assign partial protein sequences to eenes 
:or which the full DNA sequence and chromosomal location 
:ii*e known (Fig. 1). 

SUMMARY 

The studies presented in this brief review are intended to 
demonstrate the usefulness of computer-aided 2-dimensional 
gel electrophoresis and microsequencing to analyze cellular 
protein patterns, and to link protein and DNA information. 
As more information is gathered worldwide, comprehensive 
latabases will depict an integrated picture of the expression 
levels and properties of the thousands of proteins that orches- 
trate most cellular functions. 

Clearly, databases allow easy access to a large bcdv of data 
and provide an efficient medium to communicate stan- 
dardized protein information. In the future, databases will 
foster a wide variety of biological information that can be 
used to support collaborative research projects in basic and 
applied biology as well as in clinical research (2. 5. 46). Once 
a protein is identified in a particular database all the infor- 
nation gathered on it can be made available to the scientist. 
However, many problems must be solved before protein 
databases become of general use to the scientific community. 
A most urgent one is to promote standardization of the gel 
running conditions so that data produced in a given labora- 
tory may be used worldwide. Surprisingly, the gel running 
technology as it stands today is still a craftmanship art. 

Finally, comprehensive, computerized databases of pro- 
teins, together with recently developed techniques to 
microsequence proteins, offer a new dimension to the studv 
of genome organization and function (Fig. 1). In particular, 
human protein databases may become increasingly impor- 
tant in view of the concerted effort to map and sequence the 
entire human genome. This formidable task is expected to 
dominate biological research in the next decades. [fjj 

We would like to thank S. Himmelstrup Jorgensen for typing the 
manuscript and O. Sonderskov for photography. Work in the 
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Nonenzymatic extraction of cells from clinical tumor 
material for analysis of gene expression bv two- 
dimensional polyacrylamide gel electrophoresis 
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1 Introduction 

Tumors may develop by a number of different mechan- 
isms in any given cell type. At the time of diagnosis 
tumors w,|| have progressed along different pathways to* 
various stages of malignancy. To provide a basis for'indi- 
vidua! therapy it is of importance to examine specific 
properties of the tumor cell population in each patient 
A large number of different markers have been de- 
scribed in order to increase the diagnostic accuracy It is 
likely that a combination of serveral markers is needed 
m the future in order to reflect different properties of 
the tumor. One important method for the resolution of a 
large number of potential markers is two-dimensional 
electrophoresis (2-DE). Extensive efforts are being made 
m identifying various polypeptides separated bv 2-DE 
and to characterize how the expression of these polvpep- 
•des is affected by the response to cellular transforma- 
tion and various culture conditions [1.2J. It would be of 
value to transfer this information to 2-DE separations of 
polypeptides from tumor tissue samples. However one 
prerequisite is that the quality of the 2-DE gels "from 
tumor samples is comparable in qualitv with 2-DE eels 
Tom samples of cultured cells. 

Frozen tumor tissues are commonly used for various bio- 
chem.cal assessments. However, if such samples are ana- 
>zed by 2-D polyacrylamide gel electrophoresis (PAGE) 
the polypeptide patterns are obscured by contamination 

l ?.! n ', an V 0nneclive tissue Proteins. Such nontu- 
mor-cell-related variations represent serious problems in 
me interpretation and inter-patient comparison of 2-DE 

D e T n T itntt .\ Dt B ° Franzen - Div,l,on ol ' Tumor Pamoiogy. 

^ZSJ^TS^ 1 - Karo,,MkJ Ho ' p " al and 

^r«lr i0 rFP 2 " DE ; T * 0 - dimen " onal Polyacrylam.de ( el electro- 
\ P 7n * l L ,0e ' eC,r,C ,OCU$,ng: LDH - ,a « J,c Oehvdrcenwe- 

«une ,? P "* 0: PBS - Ph0SPha,e bu,Tcre<1 " line: p CNA. prolife. 
Wine cell nuclear am. S en: PIH. precise .nh.b.tors: PMSF. phenyl- 
methyl iullony, Muor.de: SDS. >od,um dodecy. sulfate: WW. w e 



patterns f^J. 2-DE patterns of cells prepared from fresh 
tumor materia] were analyzed after enzymatic extraction 
of tumor cells {4. :>] or after culturing tumor fragments in 
medium containing radioactive amino acids 16] These 
procedures may. however, lead to alterations in the gene 
expression/polypeptide patterns. We are onlv aware of 
one study where nonenzymatic extraction of cells from 
tresh tumor tissue (prostate cancer) was used to prepare 
samples for 2-D PAGE |4). We have examined enzvmatic 
extraction and various nonenzymatic preparation tech- 
niques including fine needle aspiration, for the prepara- 
tion of cells from fresh tumor tissues. We describe 
nonenzymatic extraction procedures that are rapid, lead 
to high-quality 2-DE patterns, and that alleviate the 
necessity to purify tumor cell populations from dead 
cells. 

2 Materials and methods 

2.1 Cell cultures and samples used for spot 
identification 

A rat embryonal fibroblast cell line. WT2 (a kind gift 
from Dr. J 1. Garrels and Dr. S. Pattersson) was used f r 
the identification of a number of heat shock and struc- 
tural proteins. Human normal diploid lung fibroblasts, 
w 38. human epithelial breast carcinoma cells. MDA- 
231 and MCF-7 were purchased from ATCC and grown 
as recommended. Polypeptides prepared from a leu- 
kemia type pre-B-ALL were separated by 2-DE The 
2-DE map was then analyzed by Dr. S. M. Hanash (Uni- 
versity of Michigan. Ann Arbor. USA). 
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2.2 Tumor tissues samples 

In this study. 2-DE maps from seven turn rs were used 
as representative illustrations: two aden carcinoma of 
the lung (LA. and LB. mucinous, both cases interme- 
diate grade of differentiation), one sqamous carcin ma 
of the lung (LS). one carcinoid-like breast cancer (BC) 
one microfollicular adenoma (highlv differentiated) f 
the thyroid (TA). one highly differentiated hyperneph- 
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roma. a lumor of the kidney fKHh and finally one case 
of poorly differentiated corpus carcinoma (CP). 

2J Preparation of cultured cells 

The cell monolayers were washed twice in phosphate 
buffered saline fPBS) and then scraped off in ice-cold 
PBS including pr tease inhibitors (PIH). phenylmethyl- 
sulfonyl fluoride (PMSF) 0.2 idm and 0.83 mM bcnzami- 
dine pelleted at 660 X g. 3 min (+4°C) and washed one 
time before final centrifugation at 2700 X g. 5 min. The 
wet weight of the cell pellet was recorded and the cells 
were stored at -80°C until further processing. 

2.4 Preparation of tumor tissue samples 

2.4.1 General remarks 

Macroscopically representative and non-necroiic tumor 
tissues were selected within 20 min after resection. 
Parallel samples were routinely prepared for cytology. 
The samples were processed as rapidly as possible on ice 
or at +4"C and in the presence of PIK. Cciis were 
stained with DifTQuick ( Baxter » and usually examined at 
three different occasions during the preparation proce- 
dure: (ij cytology sample, (ii) extracted cells and (Hi) 
cells after percoll gradient centrifugation. 

2.4.2 Specimen acquisition 

The strategy of sample preparation is shown in Fig. 1. 
Tumor tissue eel! samples were usually obtained by fine 
needle aspiration (NA) using a 0.7 mm needle! The 
syringe was filled with 1-2 mL of ice-cold culture med- 
ium/PIH. We found that if a tumor appeared to be very 
fibrous it is difficult to extract enough cells for 2-DE 
analysis. In these cases, two alternative techniques were 
examined, (i) The tumor was cut in the middle and the 
fresh surface scraped (SO by a scalpel. The cell-rich 
material was then transferred to ice-cold culture 
medium (LIS with 5% fetal calf serum)/PIH. (ii) A part 
of the tumor sample was placed in culture medium on 
ice for further processing at the laboratory in the fol- 
lowing way: the material was cut into very small frag- 
ments on a pre-cooled dissection plate and transferred 
to a small glass chamber with a 0.7 mm metal net 5 mm 
above the bottom of the chamber. Medium /PIH was 
added to cover the sample (8 mL) which was gently 
squeezed (SQ) towards the net in order to release and 
wash out cells. NA and SC were also compared with an 
enzymatic extraction (EE) procedure described previ- 
ously [5): Briefly, thin slices of tissue were incubated 
with collagenase (1 mg/mL) and eiastase (2 mg/mL) in 
medium for 1 h at 37°C. Extracted cells from even* 
sample were then subjected to percoll gradient centrifu- 
gation (Section 3.2.3). 



2.43 Separation of cells by Percoll gradient 
centrifugation 

The cell suspension was filtered through two nylon mesh 
filters, (i) 250 and (ii) 100 urn and then centrifuged 



at 660 X g for 3 min. Thi cell pellet was resusnend.v 
carefully in medium, using j syringe and loaded onto 
two-step discontinuous Percoll/PBS gradient. 20 4 
(density s l.OS g/mLi and 54>t. idcnsiit = 1.0* c/mL 
and centrifuged at 1000 X g for 15 mm. In this system! 
dead cells stay on the top. viable cells sediment to iht 
interphase and erythrocytes sediment to the bottom. The 
viability of cells in the top fraction and interphase was 
checked by the trypan blue exclusion test. The inter- 
phase cell layer (> <J0*\ viability) was collected and 
washed one time in a large volume PBS/P1H (centri- 
fuged at 800 X g for 3 min). Finally, the cells were resus- 
pended in 1.4 mL PBS and pelleted at 2700 X g for 5 
min. The wet weight <W\V> was recorded and the pellet 
was then stored at -80 X. 

2.4.4 Final preparation of cells for 2-D PAGE analysis 

From this point, cultured cell samples were treated 
in the same way as tumor cell samples: Each cell pellet 
was thawed on ice and resuspended in 1.S9 uL mQ water 
per mg WW <=* j.gQ x.WWi U L. The suspension uas 
frozen and thawed 4-5 X to break the cells |7|. A 
volume of (0.089 X WW i U L 10% sodium dodecyl 
sulfate (SDSl. including 33.3".. mcrcaptoethanol. was 
mixed with the sample and incubated 5 min on ice with 
(0.329 X WW) U L of a solution of DNasc I (0.144 
mg/mL 20 mM Tri^llCI with 2 niM CACI. X 21 UO. pH 
8.8) and RNase A (0.0718 mg/mL Tris) |8.9). The sample 
was frozen and lyophili/ed. Sample buffer 1 10] including 




Figure I. Experimental flo* chart showiny main sicps of the prepara- 
tion procedures. The abbreviations used for norienzymatic extraction 
procedures are: FZ; frozen sample preparation; NA, needle aspira- 
tion; SC. scraped; and SO. squeezed sample. Extracted cells are then 
loaded as a suspension (top volume of each tube) onto either 
1.07 g/mL Percoll (lefti. or a discontinuous Percoll gradient from the 
nonenzymatic extraction i middle I. or from enzymatic extraction 
(right). Cellular top- and interphase fractions are then used for 2*DE. 
For details see Section 2. 
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2.4 £ Preparation of frozen tumor tissue 

The technique has been described previously (3.12). 
Briefly, the sample is moaned frozen to a fine powder, 
homogenized, lyophilized and solubilized in sample 
buffer. 

2.4.6 Control of representative) 

The tumors were examined routinely by experienced 
pathologists and smears or imprints from the samples 
were also assessed for cytometric DNA content by 
microspecirophotomeiry. 

2*5 2-D PAGE 

2-D PAGE was performed as described [8.10] except for 
the following details. The glass tubes for IEF. 12 X 200 
mm. contained 2.0^ Resolyte. pH 4-8 (BDH) and were 
cast to a height of 180 mm. A stock solution of acryl- 
amide (Serva) and A'.A"-methylenebisacrylamide (16.7:1 
for IEF and 37.5:1 for the second dimension) was deio- 
nized by mixing with 5°/o w/v Duolite MB 5313 mixed- 
resin ion exchanger (BDH) for 30 min. filtered (with a 
0.22 um nitrocellulose filter) and stored at -70°C. 
A'.A'-Methylenebisacrylamide. A*.A*.A\N'-tetramethyleth- 
ylenediamine (TEMED) and ammonium persulfate were 
purchased from Bio-Rad. IEF tubes were prefocused at 
200 V in 60 min. To each tube a sample corresponding to 
20-40 ug protein was applied and focused for 14.5 h at 
800 V and finally 1.0 h at 1000 V using a Protean II cell 
(Bio-Rad) and Model 1000/500 Power Supply (Bio-Rad). 
The lube gels were finally extruded into 1.25 mL equili- 
bration buffer, containing 60 mM Tris. pH 6.8 (2% SDS, 
100 mM dithiothreito! and 10% glycerol), frozen on dry 
ice and stored at -70°C The second dimension (1.0 X 
180 X 90 mm) of the acrylamide concentration was 10% 



T. and the gel contained 57b mM Tris. pH S.S. jnd v > 
SDS. IEF gels were applied on top of the slab m\, it^c 
with 0.5%. agarose containing electrophoresis runnm • 
buffer (60 mM Tris-base. 0.2 m glycine and 0.K SDS r . 
and electrophoresed with 10-n m A per gel (consuni 
current I ai +10T. Six gels w ere run together in a Pro- 
tean II xi 2-D Multi-Cell (Bio-Rad). Proteins were visual- 
ized by silver staining and photographed with the acidic 
side to the left [13.14]. 

2.6 Identification of polypeptides 

Vimentin and vimentin-derived polypeptides were identi- 
fied by extraction of an MDA-231 cell lysate with O.t* m 
KCl/0.5% NP-40 115]. Tropomvosins were exctracied 
from MDA-231 and WI38 cell lysates [16]. and cytokera- 
tins were extracted from MDA-231 and MCF-" cell 
lysates [17J. The patterns were compared with published 
maps [19-211. Proliferating cell nuclear antigen (PCNA) 
was identified by immunoblouinc (PC 10 mAB, Dako- 
patt) using a semidry system (Multiphor 11 Nova Blot. 
Pharmacia-LKB Biotechnology AB) and enhanced che- 
moluminescenfe (ECU detection ( Arncfshafni. 

3 Results 

3.1 2— DE of samples prepared from normal and 
tumorigenic cultured cells 

The object of this study was to develop methods for pre- 
paration of 2-DE maps from human tumor tissue which 
have the same high resolution as those obtained from 
cultured cells. Shown in Fig. 2 are high resolution 2-DE 
gels prepared from cultured cells and one leukemia: 
SV40 transformed embryonal rat fibroblasts WT2 (Fig. 
2a); human MDA-231 breast carcinoma cells (Fig. 2b); 
human WI38 fibroblasts (Fig. 2c ) and human pre B-ALL 
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cells (Fig. 2d). Polypeptides were identified through a 
laboratory exchange of cell samples/2-DE maps and 
through 2-DE analysis of purified proteins (Table 1). 

3.2 Preparation of samples from solid tumors 
3.2.1 Fresh versus frozen tissue 

An adenocarcinoma of the lung (LA) was prepared for 
2-DE by conventional methods using frozen material 
(Fig. 3a). There are several possibilities for the poor reso- 
lution using frozen tissue, including the presence of high 
molecular weight protein aggregates. Filtering extracts 
through 0.1 urn filters (Durapore. Millipore) resulted in 
a slightly improved resolution (not shown). When fresh 
tumor tissue from tumor LA was used for sample prepa- 
ration, using fine needle aspiration to collect the cells, 
the resolution was considerably improved (Fig. 3b). The" 
use of fresh tissue resulted in a general increase in reso- 
lution, which was most pronounced in the 50-100 kDa 
molecular mass range. A number of differences in the 
protein profiles of the gels in Figs. 3a and 3b can be ob- 
served, some of which are indicated in the figures. The 
decrease in serum albumin in Fig. 3b is likely to result 
from loss of serum proteins occurring when cells were 
pelleted after aspiration. Other differences, such as the 
decreased level of transformation-sensitive tropomyosins 
(TM1-TM3). may result from enrichment of tumor cells 
in the sample of Fig. 3b. Fine needle aspiration, a well- 
established technique in cytology, extracts mainly tumor 
cells because of decreased intercellular adhesiveness of 
neoplastic cells as compared to normal tissue. Micros- 
copic examination of DifT-Quick-stained extracted cells 
from case LA revealed almost 100% tumor cells, 
whereas the whole tissue extract contained approximate- 
ly 60 n n tumor cells. 
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A ACUftJ 

aA 6/pAa-Aciinin 

B23 Protein B23 /Numatnn 

EF2 Elongation factor 2 

EFI Elongation factor 1 0 

GT Gluuthione-S-transpherase ypt 

hsp60 Heat shock protein 60 

hsp73 Heat shock protein 73 

h$p80 Heat shock protein 80. GRPT8. BIP 

hsp90 Heat shock protein 90 

hsplOO Heat shock protein 100. Endoplasmin 

I Fa Intermediary filament associated 

k8 Cvtokerattn 8 

LamB Lamm B 

Lipl Lipoconin I 

Up2 Lipoconin II 

Lip5 Lipoconin V 

Mitl Mitcon 1/6 - Fl ATPase 

Mit2 Mitcon 2 

MiU Mitcon 3 

MRP Mucine Related Polypeptides 

pcna Ploliferating cell nuclear antigen 

PLC Phcspijciipase C <ii 

RO RO/SS-A antigen 

Sa Serum Albumin 

at , o/phtf-Tubultn 

bT berho-Tubuhn 

tml Non-muscle tropomyosin isoform 1 

tm2 Non-muscle tropomyosin i so term 2 

tm3 Non-muscle tropomyosin isoterm 3 

tm4 Non-muscle tropomyosin isoform 4 

tm5 Non-muscle tropomyosin isoform 5 

TP I Triose phosphate isomerase 

V Vtmentin 

Vidl Vtmentin derived protein 

Vid2 Vimentin derived protein 

Vid3 Vimentin derived protein 

Vid4 Vimentin derived protein 

Vin V'inculin 

a. homologous position with respect to other 

b. purified protein(s) 

c. immunoblotung 
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122 Comparison of difTerenl methods for preparing 
cells from fresh tumor tissue 

Samples were prepared from breast and lung carcinomas 
using either an enzymatic treatment with collagenase/ 
elastase or using nonenzymatic preparations (Fig. 4). a 
number of differences in the protein profiles were ob- 
served in the resulting 2-DE gels, some of which are 
indicated in Figs. 4a and b. These differences include 
both increases and decreases in spot intensity. These dif- 
ferences may result from degradation of high molecular 
weight polypeptides during enzymatic treatment, in- 
creased solubilization of polypeptides, or may have other 
causes. For many tumors, it was only possible to obtain 



small amounts of material since the> uerj reserxeJ to- 
other examinations. In these cases, samples could be pre- 
pared for 2-DE using either needle aspiration J r 
-scraping. Figure 5a shows a 2-DE gel prepared iron: 
squamous lung carcinoma (LS) cells collected n> needle 
aspiration and Fig. 5b shows a eel prepared from ihe 
same tumor by scraping. In this case, a number of differ- 
ences were recorded between the two procedures, some 
of which are arrowed in Fig. 5. Sjmples obtained from 
other tumors (breast and lunci generalh showed fewer 
differences between these two" methods of cell sampling 
(not shown). These data show that different nonenzy- 
matic extraction procedures may yield different polypep- 
tide patterns. However, the number of spots with a lame 
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difference in iniensity were lower than when a nonenzy- 
matic preparation was compared with an enzymatic pre- 
paration. 

2-DE maps of satisfactory quality were prepared by a 
third procedure. Cells were released from small pieces of 
tumor by squeezing (see Section 2). S me examples of 
this are shown in Fig. 6 where 2-DE maps derived from 
a case of hypernephroma. KH (Fig. 6a). a case of thyroid 
tumor. TA (Fig. 6b) and a case of corpus cancer. CP (Fig. 
6o can be seen. We conclude that nonenzymatic tech- 
niques are useful for 2-DE analysis of a number of dif- 
ferent tumors. The quality of the resulting gels is com- 
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parable to that obtained using cultured cells ccomp- 
the gels in Fig. 2 with those in Fig. 4. 6 and "i. \\ hicn o-* 
these methods will be optimal will, in our experience 
depend on the tumor material. For example, ven small 
tumors are preferably extracted by squeezing: on the 
other hand, breast cancers (which are often fibrous i 
yield satisfactory samples using scraping. 

3.2J Purification of cells on percoll gradients 

We considered the possible advantage of separating 
viable cells from dead cells, erythrocytes, and debris 
using discontinuous Percoll gradients.* Cells collected 
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from the inicrphasc showed a viability f more than 
90% as judged by trypan blue exclusion test. However it 
as found that the yield of viable cells decreased drama- 
tically if the tissue resection was not immediately pro- 
cessed. To study the efTeci of lysis of cells during the pre- 
paration procedure. 2-DE maps were prepared from 
nonenzymatically extracted cells of case LB collected 
from the top fraction < nonviable. Fig. 7a) and interphase 
fraction (viable. Fig. 7b). Tiese 2-DE maps were 
compared with corresponding fractions (nonviable Fig 
7c. and viable. Fig. 7d) of enzymatically extracted cells" 
One clear disadvantage of the enzymatic technique was 
that when loss of cell viability occurred during prepara- 
tion, a dramatic loss of high molecular weight polypep- 
tides was observed (Fig. 7c). This was probably due to 
degradation of intracellular proteins. However, nonenzy- 
matic preparations showed fewer differences between 
viable and nonviable cells: The most pronounced altera- 
tion was a decrease of a group of mucine related pro- 
teins (Fig. 7b). W e conclude, therefore, that disconti- 
nuous Percoll gradient is necessary after enzvmatic 
extraction of cells, but can be omitted from the nonenzy- 
matical tumor sample preparation procedure. 

We used the MDA-23I cell line to study the effects of 
cell lysis and leakage of cyiosolic polypeptides during 
sample preparation. Remarkably, after 30. 50. 80 and 140 
mm of incubation in PBS/PIH at 0"C. no significant 
chances were observed in the 2-DE pattern (not shown) 
Although loss of eel! viability may not result in protein 
degradation when ceils are incubated in the presence of 
protease inhibitors, loss of cytosolic proteins would be 
expected during pelleting of cells. We monitored the loss 
of lactate dehydrogenase (LDH) activity into the super- 
naiant during incubation in PBS of MDA-231 and MCF- 
7 breast cancer cells at 20"C. In both cases, loss of via- 
bility was paralleled by release of LDH from the cells 
(Fig. 8). Alter 5 h. ?0"n of :nc MCF-7 cells, but onlv 30% 
of the MDA-231 cells were dead (not shown). 
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Figure S Jht relat.vc release . fraction in supernatant ofiouh of lac- 
tate dehydrogenase acuivuy iLDH) and eel la viability versus incuba- 
tion time of the mammary carcinoma cell lines MDA-231 and MCF-7 
during incubation in PBS at 20"C. 



These data indicate the impact of a rapid preparer 
procedure, at 1 w temperature, of fresh tumor sample 
Experiments have also been performed using onn 
1.0/ g/mL Percoll (Fig. 6c and Fig. 1. left test tube) ,n 
order to remove erythrocytes. One clear advantaee with 
this procedure, which today is routinely utilized, is a 
higher yield of viable cells, probablv due to decreased 
sample preparation time. 



4 Discussion 

We describe procedures for sample preparation from 
solid tumors for 2-DE. 2-DE maps could be derived 
from solid tumors which were similar in quality to those 
obtained from cultured cells. Compared to' meth ds 
using frozen material, the resolvine power of the 2-DE 
technique is increased, allowing examination of a laree 
number of polypeptides from tumors of different malig- 
nancies. Other investigators (12.22) have used samples 
from frozen tumors to derive 2-DE maps. We have previ- 
ously described disadvantages encountered usinc frozen 
tumor sumpies including variations in contaminating pro- 
teins between different samples |3). The methods de- 
scribed here are based on the preparation of cells fr m 
tumors without enzymatic digestion. The enzvmatic step 
could be avoided since malignant cells usuallv grow as 
solid masses which are not strongly attached t the 
matrix. Furthermore, we found tha? "omitting the enzy- 
matic digestion alleviated the necessity of purifying 
viable tumor cells on Percoll gradients. This was in sharp 
contrast to cnzymatically treated samples, where loss of 
viability leads to loss of high molecular weight pr tcins 
(Fig. 7c). 

At least in the case of lung cancer, viable and nonviable 
cells showed small dilTcrenccs in respect to 2-DE maps. 
Presumably, protease inhibitors penetrate cells and 
inhibit proteolysis. In model experiments, we observed 
leakage ol cytosolic protein (LDH) from the cells in 
parallel to loss of viability. Apparently, however, nly a 
limited decrease of the level of low molecular weight 
cytosolic polypeptides was detected using silver staining 
combined with visual inspection. We have found that 
although some tumors arc well suited for the prepara- 
tion procedure described, others are not. In general, 
good results were obtained using tumors of the lung! 
breast, corpus and lymphomas. In contrast, cells from' 
thyroid adenomas and hypernephroma showed p or via- 
bility. We were in these cases unable to separate nonvi- 
able ceils from viable cells, and we can theref re not 
evaluate the consequence of the loss of viability on 
2-DE patterns, apart from a loss of some low molecular 
weight cytosolic polypeptides. 

Highly differentiated tumors may show lower viability as 
compared with poorly differentiated tumors (Dr. Farkas 
Vanky. personal communication). A number of samples 
from thyroid tumors were prepared for 2-DE but most 
cases showed poor viability. We believe that special care 
is needed during preparation of generally highly differen- 
tiated tumor gr ups. The difference between I ss of via- 
bility/leakage of LDH of the more differentiated MCF-7 
cells and the less differentiated MDA-231 cells is in line 
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uith these observations (Fig. 8). A number of potential 
and interesting markers, like tropomyosin isoforms. cno- 
keratms and heat shock proteins, appear to be insensi- 
tive to loss of viability during the preparation procedure, 
we have to date made numerous observations of altera- 
tions in the expression of these polypeptides in breast 
cancers and lung cancers. 

Another problem that may occur, irrespective of sample 
preparation techniques used, is admixture of lympho- 
cytes. These cases are easily detectable in smears and it 
may therefore be possible to select lymphoevte specific 
spots as "internal markers" for the 2-D PAGE analysis. 
Studies using this approach are in progress. Manv of the 
polypeptides identified are structural (table 1). Since the 
expression of many of these polypeptides are known to 
vary between normal and malignant cells, the possibility 
to determine their expression simultaneously is 
appealing. In the specific case of breast cancer, altera- 
tions in the expression of intermediate filament proteins 
(cytokeratinsi are known to occur during tumor progres- 
sion |23J. Other proteins known to be differentially 
expressed between normal cells and transform^ cells 
are tropomyosins. numairin/B23. heat shock proteins 
and PCNA. To this end. we have observed alterations in 
the expression of cytokeraiin 8. hsp 90. and non-muscle 
tropomyosin isolorm 2 during malienant proeression 
(Okuzawa er «/.. ,n preparation and Franzcn et aJ in Dre- 
paratiom. " K 

The method of choice for sample preparation from 
tumor tissues will depend on the properties of the tumor 
material studied. It may be important to use onlv one 
method when comparing cases within one uroup. as dif- 
ferences were observed between methods" The advan- 
tages oi the nonenzymaiic techniques arc (i ) that it mini- 
mizes contamination with connective tissue, (ii) ihat 
problems with contamination ol scrum proteins are 
avoided, and dm that separation of viable and dead cells 
not necessary. Hereby the rcvolvinu power of *>-D 
IACiE „ maximized for the analysis of human tumors 
*md studies on inter-tumor variations in ccne expression 
are lacil.iated. in addition, the polypeptide patterns ob- 
tained may be more representative for the in m -« tumor 
eel! Nincc the. use ol enzymes and incubations have been 
minimized. 

»r ^uUI like to n W nk Dr. J. /. Carrels. Dr. S Pattersson, 
A X !: r Hanash <""' Dr * J- £- Celts for mekitta sample 
«>Hl .-DE map exchanges possible. Tins study ivas sup- 
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ported by grams from the Swedish Cancer Soaen and the 
Cancer Society in Stockholm. 
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Reference points for comparisons of two-dimensional 
maps of proteins from different human cell types 
defined in a pH scale where isoelectric points correlate 
with polypeptide compositions 

A highly reproducible, commercial and nonlinear, wide-range immobilized pH 
gradient (IPG) was used to generate two-dimensional <2-D) ge l maps of 
[ "SJmethionine-labeled proteins from noncuitured. unfractionated normal 
human epidermal keratinocytes. Forty one proteins, common to most human 
cell types and recorded in the human keratinocyte 2-D eel protein database 
were identified in the 2-D gel maps and their isoelectric points {pi) were deter- 
mined using narrow-range IPGs. The latter established a pH scale that 
allowed comparisons between 2-D gel maps generated either with other IPGs 
in the first dimension or with different human protein samples. Of the 41 pro- 
teins identified, a subset of 18 was defined as suitable to evaluate the correla- 
tion between calculated and experimental p/ values for polypeptides with 
known composition. The variance calculated for the discrepancies between cak 
culated and experimental p/ values for these proteins was 0.001 pH units. 
Comparison of the values by the /-test for dependent samples (paired test) 
gave a p-ievel of 0.49, indicating that there is no significant difference between 
the calculated and experimental p/ Values. The precision of the calculated 
values depended on the buffer capacity of the proteins, and on average, it 
improved with increased buffer capacity. As shown here, the widely available 
information on protein sequences cannot, a priori, be assumed to be sufficient 
for calculating p/ values because post-translational modifications, in particular 
A'-terminal blockage, pose a major problem. Of the 36 proteins analvzed in 
this study, 18-20 were found to be A'-terminaliy blocked and of these onlv 6 
were indicated as such in databases. The probability of A'-terminal blockage 
depended on the nature of the A'-terminal group. Twenty six of the proteins 
had either M. S or A as A'-terminal amino acids and of these 17-19 were 
blocked. Only 1 in 10 proteins containing other .V-terminal uroups were 
blocked. 



1 Introduction 

As compared with carrier ampholyte isoelectric focusing 
(CA-IEF). the application of immobilized pH gradients 
(IPGs) in the first dimension in 2-D gel electrophoresis 
offers improved reproducibility [1] because the nature of 
the pH gradient makes the resulting focusing positions 
insensitive to the focusing time [2| and to the type of 
sample applied [3). The recently introduced ready-made 
IPG strips [4] seem to be an ideal substitute for the car- 
rier ampholyte gradients, which until now have been the 
most commonly used first dimensions in 2-D gel electro- 
phoresis. The availability of standardized first dimen- 
sions opens the possibility of comparing 2-D gel maps of 
various cell types generated in different laboratories, pro- 
vided that the focusing positions of a number of easily 
recognizable polypeptide spots common to the cell types 
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in question are known. Even though this approach is 
limited to experiments performed with the same standar- 
dized IPG. the flexibility provided by IPGs allows the 
pH gradient to be adjusted to the requirements of a par- 
ticular experiment. 

Exchange and communication of 2-D gel protein data re- 
quires a pH scale that is independent of the particular 
IPG used and by which the results can be described. The 
introduction of carbamylation trains and the relation of 
focusing positions to the spots in these trains repre- 
sented a step forward towards solving the reproducibility 
problem experienced with carrier ampholyte focusing (5J. 
Problems associated with the use of carbamylation trains 
were mainly due to lack of temperature control and to 
the use of nonequilibrium focusing conditions. Accord- 
ingly, the pattern variation involved not only the re- 
sulting pH gradients, but also the relative spot positions 
as related to each other and to spots in the carbamyla- 
tion trains. Even though the question of reproducibility 
has. to a large extent, been solved, the carbamylation 
trains are still not ideal as markers because the spots in 
the trains do not represent defined entities but rather a 
large number of differently carbamylated peptides 
having close p/ values. As a result, the spots are large 
and poorly defined as compared to the ordinary polypep- 
tide spots in 2-D gel maps. 
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Neidhardi etai. [6] defined the pH gradient in 2-D gel 
experiments by p/ markers whose p/ values were calcu- 
lated from the amino acid composition. Focusing posi- 
tions of other polypeptides could be predicted from their 
composition but the pA' values needed for the p/ calcula- 
tions were unknown. Various groups employing this 
approach do not use the same pK values [6. 7] and there- 
fore, the p/ values derived in this way cannot be 
expected to describe the variation of the hydrogen ion 
activity. In spite of this fact, it is still possible to make 
approximate predictions of focusing positions because 
the pK values used to define the pH gradient are also 
used to calculate pi values and to predict the focusing 
positions. Errors in pK assignments are therefore com- 
pensated. A pH scale which corretly reflects the variation 
in hydrogen ion activity during focusing should improve 
the precision of the predictions, but this has never been 
implemented with CA-IEF focusing as a first dimension 
in 2-D gel electrophoresis. The main reason for this are 
the problems associated with pH measurements in 
focused gels containing high concentrations of urea. 

IPGs can be described from the concentration variation 
of the immobilized groups, provided that the pK values 
of these groups are known for the conditions prevailing 
during focusing. To avoid measurements on gels, Gia- 
nazza etai. (8J suggested the use of pK values derived by 
addition of determined pK shifts. Recently, direct deter- 
minations of pK differences between immobilized 
groups in IPGs were made by determining pApA' values 
in overlapping narrow-range IPGs [9, 10] and the results 
verified the applicability of the Gianazza approach. A 
description of the focusing results in a pH scale, which 
correctly describes the variation of the hydrogen ion 
activity for the focusing conditions used, not only allows 
the comparison of 2-D gel maps generated with different 
IPGs, but also opens the possibility for correlating the 
focusing position of a polypeptide with its composition 
(9J. Experiments by Bjellqvist etai [9, 10] have implied 
thai pH scales showing good correlation between calcu- 
lated and experimental pi values can be derived for any 
of the conditions commonly used for focusing in connec- 
tion with 2-D gel electrophoresis. These pH scales are 
then defined through the pK values of the immobilized 
groups in the IPG containing gel. To be useful for inter- 
laboratory comparisons, however, the pH scale has to be 
defined through pi values of easily recognizable spots 
present in the 2-D gel map. So far, pi determinations in 
a useful pH scale, combined with determinations of pK 
values needed for pi calculations, have only been made 
for the pH range 4.5-6.5 at 10 °C [9]. CA-IEF focusing as 
described by OTarrell [11] does not control the tempera- 
ture of the first dimension, which can be expected to be 
slightly above room temperature. With IPGs, the temper- 
ature commonly used is about 20°C [4, 12] or 25°C [13] 
and this is a critical parameter that needs to be con- 
trolled [14]. 

The present work was designed to compare 2-D gel maps 
of different cell types in a laboratory applying both 
CA-IEF and IPG focusing at a common temperature. To 
this end we have generated 2-D gel maps of proteins 
from noncultured, unfractionated normal human epi- 
dermal keratinocytes with IPG in the first dimension 



and a focusing temperature of 25 C We have used com- 
mercial nonlinear, wide-range IPG strips which give 2-D 
gel maps that are closely similar to the ones resulting 
with the CA-IEF technique used to establish the human 
keratinocyte database [15]. As an initial step towards 
interlaboratorv comparisons of results obtained with the 
nonlinear gradient as a first dimension we report here 
on the focusing positions of 41 known proteins that are 
common to most human cell types. The pH range 
covered corresponds to the range in classical CA-IEF 
2-D gel electrophoresis and in order to use these pro- 
teins as internal standards for comparing 2-D gel maps 
generated with other IPGs we determined their pi values 
with narrow-range IPGs in the first dimension. We have 
compared the calculated versus experimental pi values 
and show that it is necessary to have further information 
(absence or presence and nature of posttransiational 
modifications), in addition to amino acid composition to 
be able to calculate pi values that correspond to the 
actual experimental values. The pA' values used for the 
calculations are provided and the usefulness of pi predic- 
tion in relation to database information is discussed. 
Furthermore, we comment on the possibility of using 
experimentally determined p/ values to verify the avail- 
able database infbrmation on polypeptide composition. 



2 Materials and methods 

2.1 Apparatus and chemicals 

Equipment for isoelectric focusing and horizontal SDS 
electrophoresis (Multiphor v II electrophoresis chamber, 
Immobiline v strip tray. Multidrive XL programmable 
power supply. Macrodrive power supply and MuItitemp K 
II) was from Pharmacia LKB Biotechnology AB 
(Uppsala. Sweden). Vertical second-dimensional gels 
were run in the home-made equipment described in [15]. 
The IPG strips with the wide-range nonlinear pH gra- 
dient were either lmmobiline DryStrip v pH 3-10 NL, 
180 mm or alternatively 160 mm long IPG strips with a 
corresponding pH gradient. In both cases the IPG strips 
were delivered by Pharmacia LKB. lmmobiline, Pharma- 
lyte. Ampholine. GelBond as well as PAG film and the 
ready-made horizontal SDS gels (ExcelGeP XL SDS 
12—14) were also from Pharmacia LKB. Purified proteins 
and peptides were from Sigma (St. Louis. MO). 

2.2 Sample preparation 

Preparation and labeling of unfractionated keratinocytes 
as well as fibroblasts have been described in [16]. Cells 
were lysed in a solution containing 9.8 m urea, 2% w/v 
NP-40* 100 mM DTT and 2% v/v Ampholine pH 7-9. 

2.3 2-D gel electrophoresis 

First-dimensional focusing was performed according to 
Gorg etai [2] with some minor modifications, as de- 
scribed in [9]. Rehydration of the IPG strips was made 
in a solution containing 9.8 m urea, 2% w/v CHAPS, 10 
mM DTT and 2% v/v carrier ampholyte mixture. The car- 
rier ampholyte mixture consisted of 2 parts Pharmalyte 
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4-6.5. 1 pan Amphoiine pH 6-8 and 1 part Pharmalyte 
pH 8-10.5. Usually, cathodic sample application was 
used and the samples were diluted 2-20 times in a solu- 
tion containing 9.8 m urea. 4°o w/v CHAPS. l°o w/v 
DTT and 35 mx Tris base. For acidic application, the 
Tris-base was substituted with 100 m.M acetic acid. The 
degree of dilution and sample volume (20-100 uL) 
depended on the particular sample and the IPG. and 
whether visualization of the proteins was to be done by 
Coomassie Brilliant Blue or silver staining. With the 
wide-range non-linear IPG. 10-30 of total protein 
was loaded for silver staining and 100-200 Mg for Coo- 
massie staining. Focusing was done overnieht with Vh 
products in the range of 45-60 kVh with 160 mm long 
strips and 50-70 kVh with 180 mm long strips. Solubili- 
zation of polypeptides and blocking of-SH groups prior 
to the second-dimensional run. as well as loading on the 
second-dimensional gel was done as described in (9). 
The stacking gel was omitted and 5-10 mm were left at 
the top of the second-dimensional gel for applying the 
IPG strip. The space was filled with electrode buffer con- 
taining 0.5 °'b w/v agarose. Casting, running, staining and 
autoradiography were carried out as described in" [15]. 

2.4 Experimental determination of p/ values 

The determination of the pA' differences between Immo- 
bilines pA' 4.6. pA* 6.2 and pA* 7.0 necessary for the cali- 
bration of the pH scale at 25 C in 9.8 m urea was done 
as described in [9] with the same narrow-range IPGs. 
The pH scale was defined by setting the pA' value of 
Immobilinc pA' 4.6 equal to 4.61 [9] and the determined 
pA differences gave the pA* values of Immobilines pA'6.2 
and pA' 7.0. equal to 5.73 and 6.54. respectively. The pA' 
differences found arc in good agreement with values de- 
rived from [17] and [8] by extrapolation to 9.8 m urea 
concentration. As in [9J. additional narrow-range recipes 
have been used for determining p/ values. With narrow- 
range IPGs extending to pH values higher than the pA' 
value ol Immobiline pA' 7.0. anodic sample application 
was used with acetic acid added to the sample solution. 
Otherwise, cathodic sample application was used with 
the same sample bufTer as for wide-range IPGs. 

2.5 Protein compositions used for p/ calculations 

Wuh the exception of vimentin. protein compositions 
arc Irom the Swiss-Prot database [18]. For vimentin. we 
used the data from [19], where the amino acid at posi- 
tion 41 is a D instead of a S. Information in the Swiss- 
Proi database on phosphorylation has been disregarded 
because it was known from earlier studies (J. E. Celis 
unpublished results) that the spots in question corre- 
sponded to the unphosphorylated forms of the peptides. 



different substituents on the c-carbon were taken into 
account. The calculations of p/ values were made with 
the aid of the IPG-maker program [20].- 

2.7 pA' values used for p/ calculations 

For the carboxyl terminal group and internal glutamyl 
and asparty! residues the same pA' values were used as in 
19]. For C-terminal glutamyl and aspartyl residues, sep- 
arate pA: values were derived with the aid of the Taft 
equations [9. 21]. The pA' values of histidyl croups were 
calculated from the pi values of human carbonic anhv- 
drase I as in [9]. For A-terminal glycine a pA* value of 
7.50 was used. The pA' shift caused by a substituent on 
the a-carbon was assumed to be identical with the pA* 
shift the substituent caused for the amino group in the 
amino acid, i.e. 2.28 pH units were subtracted from the 
pA' values for the amino groups in the amino acids given 
in [22. 23]. The approximate pA* value of 9 for the cys- 
tenyl group was taken from [24]. For tyrosyl and arginvl 
groups we used the pA' values for the amino acids" [22. 
23]. For lysyl groups the effect of high urea concentra- 
tion on amino groups. was taken into account and 0.5 pM 
units were subtracted from the amino acid pA' value. 
These last three pA' values are far from the pH range 
under study and the results found would have been the 
same if lysyl and arginvl groups were assumed to be 
fully ionized while the ionization of tyrosyl groups were 
neglected. A complete list of the pA' values used is civen 
in Table 1. 



Table I. pA* Values used for the ionizable groups in peptides 
9.8 m urea. 25°C 



Ionizable 




group 


C-ierminal 


3.55 


V-ierminal 




Ala 


"5° 


Met 


7.00 


Ser 




Pro 


8.36 


Thr 


6.82 


Val 


7.44 


Glu 


7.70 


Internal 




Asp 


4.05 


Glu 


4.45 


His 


5.98 


Cys 




Tyr 


10 


Lvs 


10 


Arg 


12 


C-ierminal side chain groups 




Asp 


4.55 


Glu 


4.75 



2.6 Calculation of pi values 

For the p/ calculations it was assumed that the same pA* 
value could be used for an amino acid residue in all 
polypeptides and in all positions in the peptide except 
lor A- or C-terminally placed amino acids. For the pA' 
values ol the A'-terminal amino groups the efTect of the 



2.8 Statistical analysis 

Statistical comparisons of the experimental and calcu- 
lated p/ values were done on an Apple Macintosh Ilsi 
using the statistical package Statistica/Mac. release 3.0b 
(from StatSoft Inc., Tulsa. Oklahoma). Calculated and 
experimental p/ values were compared by the Mest for 
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correlated samples (paired Mest). The normality of p/ 
differences was estimated graphically by probabil'ity 
plots. The variances of the data presented here and the 
similar data on plasma and liver proteins in [9] were 
compared by the F-test. 

3 Results and discussion 

3.1 Identification of polypeptides and pi determinations 

The 2-D gel maps of [ Jf S]methionine-labeled proteins 
from noncultured. unfractionated normal human kerati- 



ief- 



CO 
Q 
CO 

I 



nocytes. focused with the nonlinear, wide-ranee IPG 
CA-IEF pH gradients in the first dimension ^re >hou» 
in Figs. 1 and 2. respectively. The IPG extends to higher 
pH values but otherwise the two patterns are \tr\ "sim- 
ilar and most of the spots in the IPG pattern can ?e 
directly related to the corresponding spots in the 
CA-IEF gel. To obtain comparable patterns ii was impor- 
tant to keep the focusing temperature as similar a> 
possible. Compared to other studies [1-4. 9. 10. 12-14). 
we increased the urea concentration in the focusing gel 
to 9.8 m because keratins streaked badly in the focusing 
dimension when 8 m urea was used, presumably due to 
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Figure I. :-D eel protein map of | J5 S|methionine-labeled proteins from noncultured. unfractionated normal human kerattnocvies focused with 
the nonlinear, wide-range IPG in the first dimension. The position of the 41 proteins analyzed in this study is indicated. 
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aggregates of acidic and basic keratins An increase in 
urea concentration to 9 m or more eliminated* these 
streaks; apart from this effect, no other maior chanees in 
the focusing positions were observed. In Fie. 1 we" have 
indicated the positions of 41 known proteins from the 
human keratinocyte 2-D ge! database that are most 
likely common to most human cell types. The choice 
was made because these proteins are'easv to identify 
with certainty. With the exception of stratifin (spot 2) 
involucrin (spot 4) and keratin 14 (spot 15). which are all' 



epithelial markers, these proteins are also preseni in 
human fibroblasts (Fig. 3) and lymphoevtes (results not 
shown) and therefore can be used as landmarks for com- 
paring 2-D gel maps derived from different ceil tvpes. In 
Table 2 the 41 proteins are listed together with their 
sample spot numbers (SSP) in the human keratinoevte 
protein database and p/ values determined in 2-D isel 
maps generated with narrow-ranee IPGs in the first 
dimension. 
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3.2 Comparison between the determined and calculated 
p/ values for human keratinocyte proteins 

Thirty six of the 41 proteins listed in Table 2 are found 
in the Swiss-Prot database. Contrary to the plasma and 
liver proteins used in [9). the pi calcuations on the pro- 
teins used in this study posed some problems that 
reflected the way in which they were characterized. The 



proteins used by Bjellqvist et al. [9] were eithe- v-> 
abundant and well-characterized plasma proteins or thev 
were identified by .V-terminal sequencing and. therefore, 
the nature of the A-terminals (acetvlated or non-acen - 
lated) was in both cases known. The proteins used in 
this study have all been characterized bv internal 
sequencing [7] and it is known that .V-terminal acetvla- 
uon occurs with high frequency in eukarvotes. 
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According to Brown and Robert [25]. proteins with acety- 
lated A-terminals correspond in weight to approximately 
80% of the soluble protein in ascites cells. Based on 
results from A'-terminal sequencing, at least 40°/o of the 
spots in the human liver protein 2-D gel map appear to 
be blocked [3J. The corresponding number, derived from 
107 spots in the 2-D gel map of human T-lymphocyte 
proteins, falls between 60 and 65% (J. Strahler, personal 
communication). Information concerning A-terminal 
blockage is not normally available, and in the Swiss-Prot 
database only 6 of the 36 keratinocyte proteins are speci- 
fied as A'-terminally blocked. We have, within the present 
material, defined 18 proteins for which the A-terminals 
are very likely to be correctly described. Six of these pro- 
teins are listed in the Swiss-Prot database as A ; -termi- 
nally blocked, four represent proteins which appear in 
the human liver 2-D gel map and have been A'-termi- 
nally sequenced as liver proteins [3] and the remaining 
eight have A'-terminal groups other than M. S and A. i.e. 
A-terminals for which A-acetylation is uncommon [26]. 
In Figs. 4A. B. C and D p/ values calculated from Swiss 
Prot database information are plotted against the experi- 



mentally determined p/ values for all the keratinoj\:c 
proteins listed in Table 2 and for the 18 selected pro- 
teins, as well as for the plasma and liver proteins \ &±\± 
from [9] valid for 10 °C)* 

The calculations show that without knowledge of the 
status of the A'-terminal group, precise predictions of p/ 
values for eukaryotic proteins cannot be achieved based 
on the information available in Swiss-Prot and similar 
databases. However, for proteins where the A'-terminal 
status is known, we find good correlation between pre- 
dicted and experimental p/ values. When the variance of 
the p/ discrepancies and the variance of calculated 
charges at the experimental p/ values derived from the 
present data set are compared with the corresponding 



There arc lour plots: (AI the 3b polypeptides from norm.il human 
keraiinocytes (no corrections). (B) the 36 polypeptides from Fit:. 4 A 
where p/ values have been recalculated for 12 polypeptides with M. 
S and A as .V-ierminally assumed blocked, based on calculated 
charge. (C) the 18 selected polypeptides with information on the 
A-terminal configuration, and (D) plasma and liver proteins. 



"ft . 

B 



I tfWtKnpMjl M 



Figure 4. Calculated vs. experimental p/ values. Lines are fitted using the least squares* criterion. (A) 36 polypeptides from normal human kcraii : 
nocytes (no corrections). <B» 36 polypeptides from Fig. 4A (including the !8 marker polypeptides) where p/ values have been recalculated 
assuming A'-terminal blockage: x indicates recalculated p/ values: nucleolar protein B23 is indicated with an arrow. (C) 18 polypeptides with infor- 
mation on A'-terminal configuration and (D) plasma and liver proteins. 
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values derived from the daia on plasma and liver pro- 
teins in (9) (Table 3). the present data are found to result 
in larger variances for the values of both p/ discrepancies 
and calculated charge at the experimental p/ value when 
no information on posttranslational modification is 
taken into consideration. Correction for possible A-aceiv- 
laiion of 12 polypeptides with M. S and A as .Y-terminal 
results in a smaller variance of p/ discrepancies, al- 
though not significantly different from values derived 
from [9], whereas the variance of the calculated charge at 
the experimental p/ value is significantly higher. For the 
18 selected proteins the variance for the p/ discrepancies 
is significantly smaller than for the data in [9]; however, 
the corresponding value for calculated charge at the 
experimental p/ value does not improve to the same 
extent. This, we believe, reflects another difference 
between the two sets of proteins used for the calcula- 
tions. Based on spot distributions in 2-D gel maps, the 
set of proteins used here has a molecular weight distri- 
bution that is more representative of the patterns ob- 
served in mammalian cells. In the study by Bjellqvist 
eiaL [9] most of the high molecular weight plasma pro- 
teins had to be excluded due to their unknown content 
of sialic acid which made the proteins analyzed in this 
study heavily biased towards low molecular weight pro- 
teins. The buffer capacity of proteins normally increases 
with the protein's molecular weight, and the average 
buffer capacity of the presently selected proteins with 
assumed known A'-terminals is 18 charge units/pH unit, 
while the corresponding value for the proteins used in 
[9] is only 9 charge units/pH unit. High buffer capacity 
can be expected to improve the agreement between cal- 
culated and experimental p/ values. Inspection of the 
data presented in Table 2 for the polypeptides with 
assumed known A'-terminals verifies the importance of 
the bulTer capacity. For 8 polypeptides having buffer 
capacities higher than 15 charge units/pH unit, the calcu- 
lations in all cases yielded p/ discrepancies with absolute 
values of less than 0.02 pH units. The largest discre- 
pancy. 0.06 pH units, was observed for annexin II and 
stathmin. proteins which have low buffer capacity: 0.9 
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and 6.6 charge units/pH unit, respectively. The proba- 
bility that the focusing position of a protein with known 
composition will fall within a certain distance from the 
calculated p/ value therefore cannot be predicted by the 
variance alone. The buffer capacity of the specific protein 
must be taken into consideration as well. As indicated 
by the decrease of the variance of calculated charges at 
the experimental pi value for the selected proteins, the 
observed improvement can not solely be due to the 
higher buffer capacity of the keratinocyte proteins. The 
two studies relate to different experimental conditions. 
Good agreement between experimental and calculated 
pi values implies that the proteins are defolded and a 
factor that may contribute to the observed improvement 
is a more complete defolding of proteins caused by the 
higher temperature and urea concentration used in this 
study. 

The data indicated that the precision with which p/ 
values can be predicted for polypeptides with high buffer 
capacity is better than the precision with which experi- 
mental pi values can be determined. If the pH is defined 
through the pK values of the immobilized groups in the 
IPG containing gel. the 1 " precision of the experimentally 
calculated data will depend on the pH difference 
between the pi and the pA' value of the immobilized 
group with the closest pA'. For the present study this will 
give pi determinations with a precision varying in the 
range of ± 0.02-0.05 pH units [9]. The good agreement 
observed between the calculated and experimental pi 
values is due to the fact that errors are mainly system- 
atic and, as discussed in [9], they will largely be cancelled 
out in the calculations. A pH scale defined through the 
presently determined pi values will not necessarily 
reflect the variation of the hydrogen ion activity during 
the focusing step in an optimal way. but it still allows 
precise predictions of focusing positions for polypeptides 
with known compositions, including information on 
posttranslational modifications. Calculated net charge at 
the experimentally found isoelectric point defined in this 
scale will serve as a tool to verify that the polypeptide 



Tabic 3. Mean values and variances for ihc dilVcrcncc (experimental p/-calculaied 
values, respect rvel> 



pf) in pH units and calculated charges at the experimental p/ 



-Plasma and liver 

proteins 
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Keratinocyte proteins 
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composition used in the calculation is correct and com- 
plete. Exceptions to this are proteins such as involucrin 
and heat shock protein 90 that have very high buffer 
capacities. Introduction of an extra charge unit into 
these proteins will only result in p/ shifts falling in the 
range of 0.01-0.02 pH units and the effect is that the 
quality of the pH definition - the precision by which pA' 
values used in the calculations are given and the preci- 
sion of experimental p/ values in these cases - will limit 
the possibilities to verify polypeptide compostion based 
on the experimental p/ value. 

Statistical comparison of experimental and calculated p/ 
values was done using the Mest for dependent samples 
and normality of the discrepancies was estimated by 
probability plots. For the 36 proteins, the p-level is 
0.0021. indicating that a result like this is unlikely to 
be a chance effect and must be assumed to represent a 
real difference. After correction for the most like l> 
A'-terminal configuration, the p-level is 0.043 and cannot 
be accepted as representing the same population since 
the p-level is less than 0.05 - the traditional p-limit of 
statistical significance. For the 18 proteins with a known 
or very likely A-terminal configuration the /-test gave a 
p-level of 0.49, which verifies that the experimental and 
calculated p/ values are not significantly different. 

Besides showing that p/ values for denatured proteins 
with known compositions can be calculated with a high 
degree of precision from average pA' values, the results 
also provide strong support for the notion that 
A-terminal blockage heavily depends on the nature of 
the A'-terminal groups [26]. The results seem to indicate 
thai with A'-terminals other than M. S and A. only a few 
proteins have blocked A'-terminals (1 out of 10 proteins 
in the present study), while it can be inferred from the 
data presented in Table 2 that a majority of the proteins 
with M. S and A as A'-terminal are blocked. After correc- 
tion for the effect of suspected A'-terminal blockage 
there is only one protein (nucleolar protein B23) out of 
the 36 used in ihis study, which, in spite of a high buffer 
capacity, has a marked difference of 0.11 pH units 
between predicted and determined p/ values (Fig. 4B); 
this corresponds to 3 charge units due to the high buffer 
capacity of this protein. This discrepancy in p/ prediction 
and calculation of net charge at the p/ is probably not 
due to deficiencies in the database information but 
instead reflects a shortcoming of the model used for p/ 
calculations. Nucleolar protein B23 contains a domain 
extremely rich in aspartic and glutamic acid residues 
(Table 4). in which 26 out of 28 amino acid residues 
from position 161 to 188 are either a D or an E. A calcu- 
lation based on the use of average pA' values unin- 
fluenced by the charged neighboring amino acid resi- 
dues cannot be expected to correctly describe the p/ 
value with almost half of the acidic groups packed 



Table 4. Amino acid sequence of nucleolar phosphoprotein B23 



together into a highly negatively charged recior.. Th:> 
limitation caused by calculations based on average ?A 
values does not severely limit the usefulness of 
approach since a search through Swiss-Prot shows thr. 
this type of D/E-rich motif is uncommon, and the exis- 
tence of a highly charged region is immediately apparerv. 
upon inspection of the amino acid sequence. 

The quality of the information available in databases, 
especially concerning posttranslational modifications, is 
a major problem when the data is to be used for p/ pre- 
dictions. The p-level of 0.043 found for all 36 proteins 
after correction for .V-acetylation. shows that this prob- 
lem is not only limited to A'-terminal blockade and the 
very good agreement found for the eighteen poi> pep- 
tides, with assumingly correctly described A'-terminal 
(Fig. 4C), must be regarded as an exception from this 
point of view. A'-Termina! blockage is generally the main 
problem in relation to p/ predictions for eukaryotic pro- 
teins. Of the 36 keratinocyte proteins analyzed". 18-20 
are suspected to be A-terminally blocked (6 proteins blo- 
cked according to Swiss-Prot, 12 proteins with M. S or A 
as A'-iern\inai and assumingly blocked based on the cal- 
culated charge, and two proteins, involucrin and 
nucleolar protein B23. with M as A-terminal for which 
the data does not allow any conclusion). This is in rea- 
sonable agreement with the conclusions based on the 
A'-terminal sequencing data derived in connection with 
2-D gel electrophoresis. A'-terminal blockage can be sus- 
pected for 17-19 of the 26 proteins with M, S or A as 
A'-terminal. while only 1 in 10 proteins with other 
A-terminal groups are blocked. The information that the 
frequency of A'-terminal blockage is strongly related to 
the nature of the A-terminal group will be of some help 
in connection with p/ predictions based on database 
information. However, without information from other 
sources, an uncertainty will always remain as to whether 
the A-terminal charge should be included in the p/ calcu- 
lation. 



4 Concluding remarks 

The data presented here lays the foundation for com- 
paring 2-D gel protein maps of different cell types gener- 
ated with nonlinear, wide-range IPGs in the first dimen- 
sion. The focusing positions of 41 polypeptides common 
to most human cell types have been described in a pH 
scale that allows focusing positions to be predicted with 
a high degree of accuracy, provided that the composition 
of the polypeptides are known and that information on 
posttranslational modifications are available. For poly- 
peptides with a very high buffer capacity, the limiting 
factor is the precision with which experimental pH 
values can be determined rather than the precision of 
the calculations. Possible deficiencies in the pH scale 
description of the variation of the hydrogen ion activity 
has. at least at the present state, no consequences for its 
practical use. The major limitation in connection with 
predictions of focusing positions from polypeptide com- 
positions is the quality of existing data on protein com- 
positions, especially concerning posttranslational modifi- 
cations. Amino acid sequences have been reasonably 
easy to obtain, while posttranslational modifications 
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have been difficult and work-intensive to determine. 
Recent developments in the field of mass spectrometry 
are fast changing this situation and within the next years 
we can expect a surge in reliable data in this area. While 
awaiting this development, verification of correctness 
and completeness of available information on polypep- 
tide composition can be provided by experimental p/ 
values in a pH scale based on the pi values determined 
in this study. So far. our data cover the pH range below 
pH = 7.5. the basic pH range covered by NEPHGE as 
first dimension will be covered in forthcoming work. 

Received December 29. 1993 
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A two-dimensional gel database of rat liver proteins 
useful in gene regulation and drug effects studies 

A standard two-dimensional (2-D) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding daub as e describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing .drugs and a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel system (the Iso-Dalt* 
system), it can be directly related to an expanding body of work in other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by OTarreil and others II— 41. has 
been used dver the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5, 6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in \ttro systems are ideal for answer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some m vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appear to show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate cell cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents (e.g.. 
fetal bovine serum ). conditions (e.y-. pH ) and genetic •evo- 
lution" of cell lines while in culture.lt is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves-. 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large" protein samples, they are generally more vari- 
able, more labor-intensive and. in the case of radiographic 
methods, may generate highly "noisy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible (8). Finally, 
there remains the question of the "truthfulness" of many in 
vitro systems as c m pared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 
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lure and the asso ci ated shift to strong selection Tor growth, 
and how do these affea experimental outcomes? Hence 
the apparent advantages f in vitro systems, in terms of ex- 
perimental manipulation, may be counterbalanced by 
other factors relating to 2-D data quality. 

There is a second important class of reasons for exploring 
the use of an in vivo biological system such as the liver. His- 
torically, there have been i»o broad approaches to the me- 
chanistic dissection of biochemical processes in intact cel- 
lular systems: genetics (a search for informative mutants) 
and the use of chemical agents (drugs and chemical toxins). 
Both approaches help us to understand complex systems 
by disrupting some specific functional element and show- 
ing us the result. With the development of techniques for 
genetic manipulation and cloning, the genetic approach 
can be effectively applied either in vitro or in vivo, although 
the in vitro route is usually quicker. The chemical approach 
can also be applied to either son of biological system; here, 
however, the bulk of consistently acquired information is 
in experimental animals (rats and mice). While most biolo- 
gists know a short list of compounds having specific, experi- 
mentally useful effects (e.g.. inhibitors of protein synthesis, 
icncphcres, polymerase inhibitors, channel blockers, nu- 
cleotide analogs, and compounds affecting polymerization 
of cytoskeletal proteins), there is a much larger number of 
interesting chemically-induced efTects. most of them char- 
acterized by toxicologists and pharmacologists in rodent 
systems. Just as a thorough genetic analysis would involve 
saturating a genome with mutations, it is possible to ima- 
gine a saturating number of drugs, the analysis of whose ac- 
tions would reveal the complete biochemistry of the cell. 
While organized drug discovery efTons usually target spe- 
cific desired effects, the nature of the process, with its de- 
pendence on screening large numbers of compounds, ne- 
cessarily produces many unanticipated effects. It is there- 
fore reasonable to suppose thai the required broad range of 
compounds necessary to achieve "biochemical saturation* 
may be forthcoming; in fact, it may already exist among the 
hundreds of thousands of compounds that failed to qualify 
as drugs. 

Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness. The brain appears to be quite plastic (e.g. 
(7J), but it is a complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidney, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importantly, the liver performs many homeostatic func- 
tions that require rapid modulation of gene expression. It 
appears that most chemical agents tested affect gene ex- 
pression in the liver at some dosage (N. Leigh Anderson, 
unpublished observations), an interesting contrast to our 
earlier work with lymphocytes, for example, which seem to 
be much less responsive. Such results conform to the expec- 
tation that cells with a homeostatic, physiological role 
should be more plastic than cells differentiated for a pur- 
pose dependent on the action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
vivo systems to be examined in detail. Significant progress 



has been made in the development of mouse, rat 




ystems include 

mouse and rat in vtvo on one level and mouse, rat and 
man in vitro on a second level, and to compare efTeci 
tween species and between systems. This approach allV^ 
us to draw informed conclusions regarding the biochen- * % 
"universality' of biological responses among the manir 
and to offer some insight into the validity of m vu/^ 
proaches for toxicological screening. We believe thisd*"* 
will be necessary if in vitro alternatives are to achieve u 
usage in government-mandated safety testing of drugs 
sumer products and industrial and agricultural chemic^ 

A number of interesting studies have been published us 
2-D mapping to examine effects in the rodent liver A nu- 
ber of investiearors have made use of the technics 
screen for existing genetic variants [8-1 1] or induced mo- 
tions [12-14]. mainly in the mouse. This work builds on tn* 
wealth of genetic information available on the mouse an;' 
its established position as a mammalian mutauon-deitc. 
lion system. While some studies of chemical effects haw 
been undenakerrin the mouse (15-17). most have'used the 
rat [18-23). The examination of the cytochrome p-450s>> 
tern, in particular, has been carried out almost exclusive!* 
on the rat [24. 25). 

These considerations lead us to conclude that rodent live: 
offers the best opportunity to systematically examine ar. 
array of gene regulation systems, and ultimately to build a 
predictive model of large-scale mammalian gene control. 
The basic underlying foundation of such a project is a reli- 
able, reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred. In this paper, we 
report such a master pattern for the acidic and neutral pro- 
teins of rat liver (pattern F344MST3). In future, this master 
will be supplemented by maps of basic proteins, and analog- 
ous maps of mouse and human liver. 



2 Materials and methods 
2.1 Sample preparation 

Li ver is an ideal sample material for most biochemical stud- 
ies, including 2-D analysis. A sample is taken of approxima- 
tely 0.5 g of tissue from the apical end of the left lobe of the 
liver Solubilization is efFected as rapidly as practical; * 
delay of 5-15 min appears to cause no major alteration in 
liver protein composition if the liver pieces are kept cold 
(e.g.. on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenizer 
(e.g., 15 mL Wheaton); 8 volumes of solubilizing solution* 

• The solubilizing solution is composed of 2% NP-40 (Sigma).' M ***** 
(analytical grade. ex. BDH or Bio-Rad), 0.5% diihiothreitol (OT^ 
Sigma) and 2% carrier ampholytes (pH 9-11 LK B : these come iS3 *\j 
stock solution, so 2 % final concentration is achieved by miking the n** 
solution 10% 9-U Ampholine by volume). A large batch ° f W^'^ 
(several hundred mL) is made and stored frozen at -80°C in ^ ,q **~T 
sufficient to provide enough for one day's estimated sample 9 rt *~~l 
lion requirement. The solution is never allowed to become 
than room temperature it any stage during preparation or itltWta ^^ t 
use, since heating of concentrated urei solutions can produce eon 
nints thit covilently modify proteins producing irtifactuil cfc* 1 ^ 
shifts. Once thawed, any unused solubilirer is discarded. 
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I (i.e., A mL per Oi g tissue) and the mixture is ho- 
I using first the loose- and then then the tight-fit* 
{glass pestle. This takes approximately 5 strokes with 
& pestle and is carried out at room temperature because 
H would crystallize out in the cold. Once the liver sample 
thoroughly homogenized in the solubilaer. it is assumed 
it all the proteins are denatured (by the chaotropic effect 
the urea and NP-40 detergent) and the enzymes inacti- 
#d by the high pH (-9.5). Therefore these samples may 
: kept at room temperature until they can be centrifuged 
frozen as a group (within several hours of preparation). 
ie samples are centrifuged for6 X 10* * min (e.g.. 500 000 
f for 12 min using a Beckman TL-100 centrifuge). The 
:nirtfuge rotor is maintained at just below room tempera- 
je (e.j.. 15-20 °C), but not too cold, so as to prevent the 
xdpitation of urea. The centrifuge of choice is a Beckman 
H00 because of the sample tube sizes available, but any 
itracentrifuge accepting smallish tubes will suffice. When 
i appropriate centrifuge is not available near the site of 
imple preparation, samples can be frozen at -80 'C and 
lawed prior to centrifugaiion and collection of superna- 
its.Each supernatant is carefully removed following cen- 
ifugation and aliquoted into at least 4 clean tubes forstor- 
5*. This is done by transferring a 1 .! the supernatant to one 
lean tube, mixing this gently (to assure homogeneous 
composition) and then dividing it into 4 aliquots. The ali- 
uots are frozen immediately at -80°C. These multiple ali- 
uots can provide insurance against a failed run o: a freezer 
•rtakdown. 



L2 Two-dimensional electrophoresis 

iample proteins are resolved by 2-D electrophoresis using 
he 20 X 25 cm Iso-Dalt* 2-D gel system ((26-29); pro- 
ceed by LSB and by Hoefer Scientific Instruments, San 
Francisco) operating with 20 gels per batch. All first-dimen- 
iional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
gDH 4-8A in the present case, selected by LSB's batch- 
nesting program for rat and mouse database work**). A 10 
sSTsample of solubilized liver protein is applied to each gel, 
and the gels are run for 33 000 to 34500 volt-hours using a 
progressively increasing voltage protocol implemented by 
^programmable high-voltage power supply. An Ange- 
Ikjue* computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
turn dodecyl sulfate (SDS) polyacrylamide gradient slab 
gels in which the top 5 % of the gel is 1 1 %T acrylamide, and 
deflower 95 % of the gel varies linearly from 1 1 % to 1 8 %T. 

gus system has recently been modified so as to employ a 
gmmercially available 30.8%T acrylamide/A'.A p -raethyle- 
tejbisacrylamide prepared solution (thus avoiding the han- 
AGog of the solid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
T^s), persulfate and A^.A^.A'-tetramethyleihylenedi- 
.Jawe (TEMED). Each gel is identified by a computer- 
Bgnted filter paper label polymerized into the lower left cor- 
jj t °f the gel. First-dimensional IEF tube gels are loaded 

bis material (succeeding certified batches of which are available from 
ioefer Scientific Instruments) has the most linear pH gradient pro- 
duced by any ampholyte tested except for the Pharmacia wide range 
{(which has an unacceptable tendency to bind high-molecular weight 
adic proteins, causing them to streak). 



directly (as extruded) onto the slab gels without equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies'*, produced by LSB) to avoid the use f hot agarose. 
Second-dimensional slab gels are run overnight, in groups 
of 20, in cooled DALT tanks (10°C) with buffer circulation. 
All run. parameters, reagent source and lot information, 
and notations of deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-page 
record of the experiment. 

13 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
of Neuhoff [30,31]) involves fixation in 1.5 L of 50% etha- 
nol and 2% phosphoric acid for 2 h. three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17% ammonium sulfate and 2% phosphoric acid 
for 1 h. followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon gels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any paniculate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal. with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is dfluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry'; (») up to 1500 spots can be reli- 
ably detected on many gels (e.g., rat liver) at loadings low 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 

2-.S Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCHO system (34— 
41]. Procedure PROC008 is used to yield a spoUist giving 
position, shape and density information for cadi detected 
sp t. This procedure makes use of digital filtering, mathe- 
matical morphology techniques and digital masking to re- 
move the background, and uses full 2-D least -squares opti- 
mization to refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored in a relational database, while various log files detail- 
ing operation of the automatic analysis software are ar- 
chived with the reduced data.The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (e.g.. 
treated and control animals). Each 2-D pattern is matched 
to the appropriate ^master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the existing rodent protein 2-D data- 
bases. The software allows experiments containing hun- 
dr ds of gels to be constructed and analyzed as a unit, with 
up to 100 gels displayed cn the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
sh wing significant quantitative differences vs. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g.. Student s t-test, Kepler* procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinately regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map,and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components* ana- 
lysis) is performed on data exported to SAS (SAS Institute). 

2.6 Graphical data output 

Graphical results are prepared in GKS and translated 
within Kepler* into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
Postscript-compatible Linotronic output device. Greyscale 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
pr teins to the left. 

2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755 M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ceuticals.ground and mixed with the diet at conr. 
of 0.075% and 1%, respectively. The hicfa 2 entraii ons 
was Purina 580IM-A (5% cholester 1 pits ftS?* 1 
late in the control diet). Animal work was carried o^T 
etiological Associates (Bethesda, MD). Animals wer*^' 
chmatized for one week on the control diet fed t«i? * c ' 
trol diets for one week, and sacrificed on dav 8 a n * 
daily doses of lovastatin and cholestyramine in anml * 
groups were 37 rag/kg/day and 5 g/kg/day isS''"* 
based on the weight of the food consumed LivtrUl^' 
were collected and prepared for 2-D electrophoresis ,? Pcs 
ing to the standard liver protocol (homogeni za inJ ° TC ' 
volumes of 9 m urea, 2% NP^O. 0.5% dithiS n2» V 
LKB pH 9-11 carrier ampholytes, followed I E*c2Sf 2 * 
tion for 30 min at 80000 X g). Kidney, brained 
samples were frozen. Gels were run as described It 
and the data was analyzed using the Kepler* svstem r 7" 
were scaled, to remove the effect of differences inp£ 
loading, by setting the summed abundances of a laree n f 1 
ber of matched spots equal for each gel (linear selling, 

3 Results and discussion 
3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 uLof solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant, Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal pi standards) and S DS molecular mass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p/ values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using p/ measurements expressed 
as pH. A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular orga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse live' 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results f these studies will be 
presented systematically in a later edition of this database 
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fire include here a useful series of 22 orienting identifi- 
jjonsas an aid to otherusers of the rat liverpaitern (Table 

m 



^Carbimylaied chaise standard*, computed p/s and 
' molecular mass standardization 

febave previously shown thai the use of a system of close- 
spaced internal p/ markers (made by carbarn ylating a 
jsic protein) offers an accurate and workable solution to 
it problem of assigning positions in the pi dimension [32]. 
be same system, based on 36 protein species made by car- 
ainylating rabbit muscle CPK. has been used here to as- 
ign pfs to most rat liver acidic and neutral proteins. The 
aadards were coelectropboresed with total liver proteins, 
n d the standard spots added to a special version of the 
laster pattern F344MST3. The ge! A-coordinates of all 
: ver protein spots lying within the CPK charge train were 
hen transformed into CPK p/ positions by interpolation 
tttween the positions of immediately adjacent standards 
Table 1) using a Kepler* vector procedure. 

thas proven possible to compute fairly accurate pi values 
or many proteins from the amino acid composition [42]. 
JVe have attempted here to test a further elaboration of this 
lpproach. in which we computed pf s forthe CPKstandards 
Jiemselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact thai adjacent members of the 
iarge train typically differ by blockage of one additional ly- 
ane residue (Table 3). We compared these values to similar 
computed pf s for an additional set of carbamylated stand- 
ards made from human hemoglobin beta chains and a se- 
ries of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig. 7,Table 4).The result demonstrates 
good concordance between these systems. Two proteins 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(#20 in the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
parent spot closer to the expected p/, not resolved in the 
JEF/SDS gel. Of particular importance is the fact that, by 
comparing c mputed p/"s of sequenced but unlocated pro- 
fins with the CPK pA, we can assign a probable gel loca- 
$Q0 without making any assumptions regarding the actual 
gel pH gradient. This otters a useful shortcut, given the va- 
garies f pH measurement on small diameter IEF gels. We 
gye used this approach to compute the CPK pfs of all rat 
fid mouse proteins in the PIRsequence database, as an aid 
protein identification (data not shown). 

J.orderto standardize SDS molecular weight (SDS-MW), 
j have used a standard curve fitted to a series of identified 
Jgiteins (Fig. 8). Rather than using molecular mass per se, 
^e have elected to use the number of amino acids in the 
polypeptide chain, as perhaps a better indication of the 
" ngth of the SDS-coated rod that is sieved by the second 
Qension slab. The resulting values were multiplied by 
(the weighted average mass of amino acids in se- 
jjenced proteins) to give predicted molecular masses. Be- 
Use we use gradient slabs, we have n t constrained the fit- 
^curve to conform to any predetermined model; rather 
joined many equations and selected the best using the 
Jgram "Tablecurve" on a PC. The equation chosen was> 
y- 6x+ c/x 1 , where .vis the number of residues, x is the gel 



^coordinate, a is 5 1 1 .83, b is -0.273 1 and c is 33 1 8380 1 . The 
resulting fit appears to be fairly good over a broad range of 
molecular mass. 

3 J An example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by three 
agents included in the diet: lovastatin (Mevacor*,an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequestrant that has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol itself. The 
frrst two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment ofFers an interesting test of the 2-D mapping 
system since most of the pathway enzymes are present in 
low abundance, many are membrane-bound and difficult 
to solubilize, and the pathway itself is complex. Approxima- 
tely 1000 proteins were separated and detected in liver ho- 
mogenates. Twenty-one proteins were found to be affected 
by at least one treatment, and Hbese could be divided into 
several coregulated groups. 

3.3.1 MSN 413 (putative cytosolic HMG-CoA synthase) 
and sets of spots regulated coordinate!) or inversely 

One group of spots (including a spot assigned to the cyto- 
solic HMG-CoA synthase, MSN 413) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number 413 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075 % lovastatin and 
1 9m cholestyramine in the diet (Figs. 9 and 10). Its expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine, and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp fcDohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reductase in the liver cholesterol biosynthesis pathway, and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK p! of -1 1.4, in 
reasonably close agreement with a molecular weight of 
57300 and a CPK pi of -15.7 computed from the known se- 
quence of the hamster enzyme [43]. 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level of con-elation 
was exceedingly high O 95%). Two of these, 1250 and 933, 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the bservation that both 
spots are also stained by the antibody to cytosolic HMG- 
CoA synthase. The remaining three correlated spots appear 
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to comprise an additional related pair (1253 and 1001) of 
around 40 kDa and a single spot (11 19) of ar und 28 kDa 
Because these two presumed proteins are present at sub^ 

$ T2^]?T abundances lhao 413 - aod b«ause the cvto- 
solicHMG-CoAs>-nthase is reported toconsist fonlvone 
type of polypeptide, they are likely to represent ther.' verv 
tightly coregulated engines. A second group of six spots 
was selected based on a regulator) pattern close to the in- 
verse of that for spot 413 (MSN"s34,79. 178. 182,204 347- 
data not shown). For these proteins, the lowest level of ex- 
pression occurs with exposure to lovasiaiin plus cholestyra- 
mine and the highest level upon exposure to the high-cho- 
lesierol diet. Spots 182 and 79 are highly correlated and lie 
about one charge apart at the same molecular weight- they 
may thus be isoforms of a single protein. The other" four 
spots probably represent additional enzymes or subunits. 

3 J 2 MSN 235 and coregulated spots 

A third group of five spots, mainly comprised of mitochon- 
drial proteins including putative mitochondrial HMG- 
Co A synthase spots, showed a modest induction by lovasta- 
lin alone, but little or no effect with any of the other treat- 
ments (including the combination of lovastatin and choles- 
tyramine; Fig. 12). This result is intriguing because lovasta- 
tin was expected to affect only the regulation of enzvmes of 
cholesterol synthesis, which is entirely extra-mitochon- 
dnal. Three of the spots (235. 134, 144) form a closely- 
packed triad at approximately 30 kDa. and are likely to re- 
present isoforms of one protein. All three spots are stained 
by an antibody to the mitochondrial form of HMG-CoA 
synthase obtained from Dr. Greenspan. Subcellular fractio- 
nation indicates a mitochondrial location. The other two 
spots (633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
triad. 



fim,M «w« 199,, 



to. 



3 33 An example of an anti-synergistic effect 

A sixth spot (367) shows strong induction by lovastatin 
(two- to threefold), and about half as much induction with 
lovastatin plus cholestyramine, but without sharing the ani- 
mal-animal heterogeneity pattern of the 235-set (Fig 13) 
This protein is also mitochondrial, and represents the clear- 
est xample of an anti-synergistic efTect of lovastatin and 
cholestyramine. The existence of such an efTect demon- 
strates that lovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

3.3.4 C mplexity of the cholesterol synthesis pathway 

Taken together, these results suggest that treatment with lo- 
vasutm alone can affect both cytosolic and mitochondrial 
pathways using HMG-CoA, while cholestyramine, on the 
other hand,eitheraJone or in combination with lovastatin 
produces a strong effect on the putative cytosolic pathway! 
but little or no effect on the putative mitochondrial path- 
way. An explanation for this difference may lie in lovasta- 
tin s effect on levels of HMG-CoA and related precursor 
compounds that are exchanged between the cytosol and 
the mitochondrion, whereas cholestyramine should affect 
only the cytosolic pathways directly controlled by cholester- 
ol and bile acid levels. It remains to be explained why some 



proteins of the putative mitochondrial path*,, 
much more variable in their expression in all ,J2 are *> 
animation of all the coregulated groups summiYIE An e * 
titative statistical techniques can extract a wealth «r- Uan - 
estmg information from large sets of reproducible d. "2?- 
abundance of spots in the 413 coregulation eroun ! 
pie. shows an amazing level of concordance in " 
expression among the five individuals of the | 0 v«i», • * 
cholestyramine treatment group. This effeci is not <i B,ne 
differences in total protein loading. since thev have »?, Ue -1° 
been removed byscaling.and since proteins with ft 
ferem regulation patterns can be demonstrated u i v 
13 ). Such effects raise the possibility that manv g ene L 1 
lation sets may be revealed through the study of a 
cienily large population of control animals (/. J it l 
any experimental manipulation). This approach exniof, 0 " 1 
natural biological variation in protein expression SsW 
drug effects, offers an important incentive for [he coSf 
non of a large library of control animal patterns 



4 Conclusions 

V 

Because of the widespread use of rat liver in both basic bio- 
chemistry and m toxicology, there is a long-term need fora 
comprehensive database of liver proteins. The rat liver mas 
ter pattern presented here has proven to be an accurate re 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation. Its practical utility in sev- 
eral areas of mechanistic toxicology is already being de- 
monstrated. 
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5 References 

fl] O'Farrcll, P../ Btot. CAem. 1975.250, 4007-4021. 
[2] KJose. J.. Humangenetik 1975, 26. 231-243. 
[3] Scheelc, G. A.,/ BioL Chem. 1975. 250. 5375-5385. 
|4J lborra, G. and Buhler, J. M.. Anal. Biochem. 1976. 74. 503-511. 
[5] Anderson, N. G. and Anderson. N. L.. Behnng. Inst. Mm. 1979. 61 
169-210. 

[6] Anderson, N. G and Anderson. N. L.. Chn. Chem. 1982.25, 739-74S 

(7] Heydorn. W. E.. Creed, G. J. and Jacobowiu, D. M., J. Pharmacol. 
Exp. Therap. 1984. 229, 622-628. 

[8] Anderson. N.L-.Nance.S.UToUakscn.S.L.,Giere,F.A.and Ander- 
son, N. G., Electrophoresis 1985. 6, 592-599. 

f9] Racine. R. R. and Langley, C. H.. Biochem. Genet. 1980. IB, 185-197. 
110) Klose. J.. Mol. Evol. 1982, 18. 315-328. 

(1 1] Neel.J. V.. Baier, L.. Hanash.S.and Erickson, R. P., / Hered. 1985. W. 
314-320. 

[12J Marshall, R. R..Raj.A.S.,Grant,FJ.and Heddlc.J. A.. Con J.Ge*r< 
CytoL 1983. 25. 457-446. 

fl3J Taylor.J..Andcrson.N.L.,Anderson.N.G..Gemmell.A..Gioment. 
C. S.. Nance, S.L. and Tollaksen. S. L.. in: Dunn. M J. (Ed.), 
phoresis '86, Verlag Chemie. Weinheim 1986, pp. 583-587. 

|U] Giometti, C. S.. Gemmell. M. A.. Nance. S. L., Tollaksen. S. I 
Taylor, J.. J. Btol. Chem. 1987, 262. 12764-12767. 

[ 15] Anderson. N. U Giere, F. A., Nance. S. L., Gemmell. M. A.. TolUk- 
sen. S. L. and Anderson, N. G.. in: Gaiieau, M.-M. and Siesi. C 
(Eds.). Progres Recent* en Electrophone Bidimensionetle. Pre*** 
Universiiaircs de Nancy. Nancy 1986. pp. 253-260. 

[16] Anderson. N. L., Swanson, M.. Giere. F. A., Tollaksen. S.. GemfnelL 
A., Nance, S.L. and Anderson, N. electrophoresis 1986. 7. 44-** 



Dftubtie of ni li*cr proums 



913 



Afidenoa. N. L. Giert.F. A- Since S. L_ Getnmeil. M. A^TolUi- 
«n, S. L tad Anderson* KG- F**Jam. Appl Toxtcol. 1 987. 1 39-50. 
Anderson,N.L-.in: hewHorcotu in7ax2eoio&. Eli Lilly Symposium. 
1991. in press. 

Afltotne. Ik Rahimi-Pour. A. SiesL Mag dalou. J. and GaJteau. 
M. Cell. Btochem. Funa. 19r. /. 21-231. 
ElUotL B. M.. Ramasamy. R_ Sionard. M. D. and Sprit*, S. P. Bio- 
dim. Btophys. Acta 1986. £*t. 135-140. 

Huber. B. E.. Heilman. C. A.. Wlrtf.. P. J.. Milter. M.J. and Thor* eirv 

son.S. S^ Hepatoi&r 198e. i. 20«-:i9. 

Wirth. P. J. and Vesierberj. O- Ete^ropnoresis 1988. V. 4 T -53. 

Wiumann. F. A. and Parker. D. S.. 7oxta>:. Let:. 1991. 5\ 29-36. 

Rampersaud. A.. Waxmac. D. J - Ryan. D. E.. Levin. V and Wal2. F. 

G^U.*Arth. Biochem. Btophys. 1985. -Vi. 174-U3. 

VUsuk. G.P.axid Walt F.G„)x~Anal. fiiortrm. 1980. 1 12-120. 

Anderson, N. G. and Anderson. N. I_ .4«a.'. Btochem. 1978. 5.\ 331— 

340. 

Anderson. N. L. and Anderson. N. C J. Btochem. 1978. £5. 341— 
3S4. 

Anderson. L.. Hofmann. J. -P.. Andersoc. E.. Walker. B. and Ander- 
son.N.C.in: Endler. A.T.ani Hir.iih.S . fEds.i. U ^Dimensional 
Electrophoresis. VCH Verlagsg esellschaiu Wetnheim 1969. pp. 288- 
297. 

Anderson. L.. Two~Dimenstonat Electrophoresis: Operation of the 
ISO-DALJ* System. Large Scale Biology Press. Washington. DC 
1988, ISBN 0-945532-00-8. 170pp. 

Neuho!T.V..Summ.R.and &bl.H~Eieetrophoresis 1985.6.427-448. 



13 1] NeuhofT. V.. Arold. N.. Taube. O. and Ehrturdi. W„ Electrophoresis 
1988, 9, 255-262. 

[32) Anderson, N.L.and Hickman. B.J.. Anal. Biochem. 1979, 9i. 3 1 2-320. 

133) Sidman. K. E.. George. D. E., Barter, W. C. and Hunu L. T., AW/. 
Acids Res. 1988. 76. 186^-1871. 

[34] Taylor. J.. Anderson, N . L.. Coulter. B. P.. Scandora. A. E. and Ander- 
son. N.G.. in: Rjdola.B.J.(Ed.).±>rr'opA<wm 7$.deGruyter. Ber- 
lin 1980. pp. 329-339. 

[35} Taylor. J.. Anderson. N. L. and Anderson. N. G .. in : Allen. R. C. and 
Arnaud. P. (Eds.). Electrophoresis '81. de Gruyier. Berlin 1981. pp. 
383-400. 

(36) Anderson. N. L-.Taylor. J.. Scancora. A. E.. Coulter. B. P. and Ander- 
son. N. G.. Clin. Chem. 1981. .\\ I807-1S20. 

(37) Tay lo r. J.. Anderson. N.L.. Scandora. A. E.. J r..W r illard.K.E. and An- 
derson. K. G., Clin. Chem. 1982. 28. 861-866. 

(38) Taylor. J.. Anderson. N.L.and Anderson. S.CElearophorest s 1983, 
.< 338-345. 

[39] Anderson. N. L. and Taylor, J., in: Proceedings of the Fourth Annual 
Conference and Exposition of the Sat tonal Computer Graphics Associa- 
tion. Chicago. June 26-30. 1983. pp. 69-76. 

140] Anderson. N. L.. Hofmann. J.-P.. Gemmell. A. and Taylor. J.. Clin. 
Chem. 1984, J0, 203 1-2036. 

[41) Anderson. L., in: Schafer-Nielsen. C. (Ed.). Electrophoresis "M, VCH 
Verlagsgesellschaft, Weinheim 1988. pp. 313-321. 

|42] Neidhardt. F. C, Appleby. D. A.. Sankar, P.. Hutton. M. E. and Phil- 
lips, T. A.. Electrophoresis 1989. 10. 116-121. 

[431 Gil. G„ Goldstein. J. L., Sliuflhier C, A, and Brown. M. S.. J. Bio!. 
Chem. 1986, 261. 3710-3716. 



Daubuc of nt hvcT pretcms 915 



oo o 




o 



o 

o 



c o 



'e 2. Schematic representation of the master pattern <the same as Fig. 1 ), useful as an aid in relating specific areas of Fig: 1 and the following detailed 
i rants. 



916 



1 



SEW" 3 * -~ 



e«5 




£5" 



oio*r 




C* 



0 s * 3 C*^ 



o w w -» y» I 



/-J J I 

-toil 



QrfM 



ejau 
0>* 



3- O" 



o 3 * 



o'°" A 



//jure 3. Upper left (bigb molecular weight, acidic) quadrant (#1) of ibe rat liver map, showing spot numbers. 
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Lower right (low molecular weight, basic) quadrant (#4) of the rat liver map, showing spot numbers. 
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4 Figure 7. (a) Plot of computed isoelectric point versus gel J.posiuon fo- 
two sets of carbamylated sundard proteins (rabbit muscle CPK M ail- 
human hemoglobin 6 chain. Tilled diamonds) and several other proteinl 
(shaded squares), (b) The identities of the various proteins represented 
by the squares are indicated by the numbers in correspond.ng position, 
on (a); these refer to Table 4. 
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Figure 9. Montage showing effects in lb* 
region of MSN:413.7he monuge sho«* 
small window into one ponion of the 3-D 
pattern, one row of windows for each eij*" 
rimental group, and one panel for each f* 1 
in the experiment. The left-mosi pattert 
in each row is a group-specific copy of ^ 
master pattern followed by the pa"'"" 
for the five individual rats in the gro^ 
The highlighted protein spots (filled cirr 
les) are spot 4 13 (on the right of etch p»»- 
el; identified as cyiosolic HMG-CoA*£ 
thase) and two modified forms of it (I** 
and 933). From the top, the rows <exp**" 
mental groups) are: high cholesterol. con- 
trols, cholestyramine, lovastatin. and I***" 
statin plus cholestyramine. 



lf9l./J.W>-tM 



Daub** of ni b«er proteins 



921 



Regulation of Rat Liver 413 



(PuUli** Cytoooft: KUG-CoA Synthaat, S3kd) 
Tort Compounds an DM 




5.000 



10.000 15.000 20.000 

Protein Abundance 



25.000 



30.000 



Ftgure 10. Bargraph showing the quantita- 
tive effects of various trcatmcnis on the 
abundance orMSN:413 (cyiosolic HMG- 
CoA synthase) in the gets of Fig. 9. 




Figure 11. Bargraphs of a series of six core- 
gulated spots including MSN:413. in the 
bargraphs. the abundances of the appro- 
priate spot (master spot number shown at 
the top of the panel) in each animaJ are 
shown. The five five*animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin. and 
lovastatin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver (one 2-D gel). Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 



Amtonan ami. 




'-*J9 




•III 




I i i I 



II Li. 

II mi 




Ftfvre i:. Data on a second coreiulatr.' 
group of spots, presentee as m Ftc.I] T:.. 
fourth experimental group tlovasu::r 
shows a modes: induction, while ihe fifr 
group (lovastatin plus cholesiyrammj. 
does not. 




Figure U. Data on spot MSN:367. presented as in Fig. U This P**£ 
shows unambiguously ihe an ti -synergistic effect of lovastatin and 
tyramine (fifth group) as compared to lovastatin (fourth group)* 7 *'* 
ponse contrasts strongly with the regulation pattern seen in Fig- 
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Y CPKtf SDSMW 



MSN 



Y CPK4 



n 

15 
17 
18 
19 
20 
21 
22 
23 
24 
25 



311 
566 
612 
549 
645 
620 
906 
755 
649 
1204 
332 
787 
313 
607 

27 1184 

28 1263 



743 
768 
1216 
1145 
1037 
663 
712 
763 
304 
1155 
684 
1318 
1924 
1203 

47 1391 

48 309 
605 
621 

1113 
1820 
725 
2001 
722 
676 



29 
30 
32 
33 
34 
35 
-'36 
38 
39 
41 

42 
43 
44 
46 



49 

.50 

51 
• 52 
"53 

54 

55 

56 

57 1682 

58 1091 

59 1171 

60 1400 

61 1853 

62 1886 

65 735 

66 1263 

67 1252 

68 779 
1064 

656 
638 
1582 



71 
72 
73 

74 1570 

75 1264 

76 1338 

77 1833 

78 1767 

79 925 



80 
61 



534 
1811 



82 1412 

83 1471 

84 1662 

85 1S96 
* 1817 
87 



434 <-3S.0 
263 *24J 
•16.0 
-25.2 
-15 J 
-21.6 
-14.0 
-17.5 
-20.9 
-6.7 



426 
268 
520 
569 
414 
298 
403 
448 



434 <-35.0 

424 -16.6 

417 <-35.0 

516 -16.1 



524 
446 

605 
112 
417 
445 
555 
412 
606 
694 



607 
589 
362 
586 
447 



535 

522 

499 

177 

500 

630 

533 

302 

580 

565 

624 

508 

567 

297 

312 

407 

692 

296 

569 

545 

583 

556 

621 

564 

363 

565 

738 

696 

363 

681 

347 

563 

479 

301 



516 1371 
1589 696 



90 
91 
92 



1706 
651 
1415 
1773 



93 1338 

94 1708 



719 
329 
710 
545 
446 
696 



-9.0 
-8.0 
-17.8 
-17.2 
4.6 
-9.5 
-11-3 
-14.9 
-18.7 
-17 3 



470 <-35.0 

-19.6 
-7.3 



-0 1 
-8.7 
-6.3 



454 <-35.0 
587 -22.5 



-21.8 
-10.0 
•0.9 
-18.3 
>0.0 
•18 4 
•19.8 
•2.5 
-10.3 
4.2 
-6.2 
•0.6 
•0.4 
•18.1 
•6.0 
•8.1 
•16.8 
-10.8 
•20.6 
-21.2 
•3.6 
•3.6 
-B.0 
-7.0 
-0.6 
-1.5 
•13.6 
•26.1 
•1.0 
•6.0 
•5.0 
-2.7 
-34 
-0.9 
-27.0 
-3.5 
-2.2 
•20.6 
-6.0 
-1.4 
•7.0 
2-2 



63.600 
102.900 
64.800 
101.000 
56.200 
50.000 
66.300 
90.200 
67.900 
62.100 
63,600 
65.000 
66.000 
55.500 
54.900 
62.400 
49.000 
348.600 
66.000 
62.500 
52.400 
66.600 
48.900 
43.800 
59.600 
51.400 
48.800 
50.000 
74,600 
50.200 
62.300 
61.500 
50.100 
53.900 
55.000 
57,000 
170.800 
56.900 
37.300 
54.100 
89.000 
50.60C 
50.300 
47.800 
56.200 
51.500 
90.500 
65.900 
67.300 
43.900 
90.80C 
50.000 
53.100 . 
50.400 
52.300 
48.000 
51.800 
74.400 
51.700 
41.600 
43.600 
74.500 
44.500 
77.500 
51.600 
56.900 
89.100 
17.400 
43.600 
42.500 
61.700 
43.000 
53,200 
62.300 
43.700 



95 
96 



1119 
1731 



97 1033 
96 1406 



536 
756 
566 
565 
578 1149 





2004 


538 


101 


1106 


623 


1QB 


482 


455 


103 


665 


630 


104 


773 


1182 


105 


312 


1117 


106 


1769 


509 


10/ 


1565 


720 


106 


1692 


807 


109 


1462 


593 


110 


778 


516 


111 


1726 


700 


113 


1191 


680 


114 


1296 


185 


115 


682 


907 


116 


1146 


610 


117 


1548 


. 849 


116 


1060 


577 


120 


1530 


628 


121 


838 


423 


122 


1572 


712 


123 


23 


1433 


124 


621 


1474 


125 


1298 


862 


126 


872 


921 


127 


1000 


717 


128 


1229 


311 


129 


1422 


832 


130 


1776 


499 


131 


1930 


757 


132 


660 


S37 


133 


666 


1019 


134 


1271 


862 


1 Ji 


1161 


1389 


136 


4 S3 


1063 


1 ^7 

i j/ 


1658 


823 


138 


1504 


697 


* ^fi 

1 39 


1488 


707 


140 


1689 


756 


141 


31 1 


1417 


142 


1366 


915 


143 


1429 


346 


144 


615 


1017 


145 


2006 


566 


146 


2006 


518 


147 


1070 


1106 


148 


1347 


578 


149 


541 


1481 


150 


1645 


760 


151 


1269 


236 


152 


1507 


911 


153 


1722 


448 


154 


932 


503 


15S 


1031 


294 


156 


1970 


684 


157 


1258 


163 


158 


1275 


417 


159 


1663 


820 


160 


1034 


527 


161 


1953 


771 



162 1020 

164 1566 

166 1905 

167 1340 

168 1506 

169 1336 

170 1969 



171 
172 
173 



800 
476 
919 



1482 
806 
565 
181 
583 
678 
S41 
378 
958 

1314 



-9.9 
-2.0 
-11.4 
-6.1 
•23.8 
>0.0 
-10.1 
•28.5 
•20.2 
-17.0 
<-35.0 
-1.5 
•3.6 
-2.4 
-4.8 
-16.9 
-2.0 
•8.9 
-7.5 
-19.6 
-6.5 
-4.1 
-11.1 
-4.3 
-15.4 
-3.8 
«-35.0 
•21.9* 
-7.5 
-14.7 
•12.0 
-84 
-5.8 
-1 4 
-0.1 
-20.4 
-20.2 
-7.9 
•9.3 
-29.7 
-0.6 
■46 
•4.8 
-2.4 
<-35.0 
-6.7 
-5.7 
-22.1 
>0.0 
>0.0 
-10.7 
-6.9 
-25.7 
•2.8 
•7.9 
-4.5 
-2.1 
•13.5 
-11.4 
>0.0 
•8.1 
-7.8 
2 6 
-11.4 
>0.0 
-11.6 
•3.8 
-0.2 
-7.0 
•4.6 
-7.0 
>0.0 
-16 Z 
•28.7 
-13.7 





MSN 


X 


Y 


CPKoJ 


SOSMW 


53.600 


174 


1364 


183 


•6.7 


162.9X 


40.700 


175 


825 


393 


•15.7 


69.3a 


51.600 


177 


1582 


553 


-3.6 


52.6a 


51,700 


178 


1321 


710 


-7.2 


43.0a 


25.000 


179 


1089 


615 


-10.4 


48.3a 


53.700 


180 


1866 


567 


•0.5 


51 .6a 


47.900 


181 


411 


295 


-32.1 


91.2a 


61.300 


182 


804 


730 


-16.2 


42.0a 


37.300 


184 


1860 


896 


-0.6 


* 34.5a 


23.800 


185 


1997 


1017 


>0.0 


29.8a 


26.100 


186 


279 


1113 


<-35.0 


26.3a 


56.100 


187 


773 


296 


-17.0 


90.8a 


42.500 


188 


1536 


807 


-4.2 


38400 


38,300 


191 


1560 


674 


-3.9 


44.9a 


49.700 


192 


1818 


687 


-0.9 


44 200 


55.500 


193 


1460 


555 


-5.0 


52.4a 


43.500 


194 


1380 


266 


•64 


ioi.6a 


44.500 


195 


784 


632 


•16.7 


47.3a 


160.800 


196 


1227 


1185 


•84 


23.7a 


34.10C 


197 


667 


553 


-20.1 


52.6a 


48.700 


198 


2006 


681 


>0.0 


44KX) 


36.50C 


199 


1711 


674 


-2.2 


44.9a 


50.80C 


2a 


872 


424 . 


-14.7 


65.0a 


37.40C 


201 


292 


435 


<-35.0 


63.700 


CS.2CC 


ZOZ 


736, 


253 


•18.0 


i07.ea 


42.90C 


203 


786 


829 


•16.7 


37.4a 


15.30C 


204 


1224 


569 


•8.5 


50.oa 


13.90C 


205 


439 


963 


-30.9 


3i.ia 


36.00C 


206 


1994 


571 


>0.0 


51.3a 


33.5a 


207 


1895 


687 


•0.3 


44.2a 


42.6a 


206 


240 


1418 


<-35.0 


i5.sa 


86. 10C 


210 


17a 


499 


-2.3 


57.oa 


37.30C 


211 


902 


517 


-14.1 


55.4a 


57.00C 


213 


1067 


684 


•10.4 


44.4a 


40.7a 


214 


1340 


668 


-7.0 


45.2a 


53.8a 


215 


1501 


495 


•3.5 


57.3a 


29.7a 


216 


1565 


755 


-3.6 


40.7a 


36.00C 


217 


1159 


393 


-9.3 


69.3a 


i6.aa 


218 


931 


572 


•13.5 


51.2a 


28,i a 


219 


713 


177 


-18.7 


1 70.5a 


37.7a 


220 


1479 


911 


-4.9 


33.9a 


43.7a 


221 


965 


927 


-12.8 


33.3a 


43,2a 


223 


934 


716 


-13.5 


42.7a 


40.7a 


225 


1812 


1045 


-1.0 


28.600 


15.8a 


226 


821 


411 


•15.8 


66.600 


33.8a 


227 


1586 


1483 


-3.6 


i3.6a 


77,9a 


228 


1065 


567 


-10.8 


5i.6a 


29.8a 


229 


1577 


890 


-3.7 


34.8a 


5i.6a 


230 


1458 


496 


-5.2 


57.3a 


55.3a 


232 


1440 


849 


-5.5 


36.5a 


26.5a 


234 


1692 


489 


-24 


57.9a 


50,8a 


235 


618 


1004 


•22.0 


30.3a 


13.7a 


236 


920 


1138 


-13.7 


25.4a 


40.5a 


237 


952 


1006 


•13.1 


30.2a 


117.0a 


238 


1611 


541 


-3.2 


53.5a 


33.900 


239 


1489 


720 


-4.8 


42.5a 


62.1a 


240 


501 


448 


-27.7 


62.1 a 


56.600 


. 241 


1820 


569 


-0.9 


51.4a 


9i.4a 


242 


1357 


658 


-6.6 


45.600 


44.400 


243 


711 


1162 


-18.7 


23.600 


i62.4a 


244 


1855 


621 


•0.6 


48,0a 


65,9a 


245 


1189 


474 


•8.9 


59.3a 


37,ea 


246 


551 


459 


-25 1 


61 ,oa 


54.600 


247 


1348 


604 


-6.9 


49.1a 


40.000 


248 


460 


448 


•29.3 


62.1a 


13.7a 


249 


1733 


451 


•1.9 


6i. sa 


38.400 


250 


1974 


788 


>0.0 


39.2a 


5i,7a 


251 


806 


392 


-16.1 


69.5a 


164.900 


252 


874 


553 


•14.6 


S2.SO0 


50.4a 


253 


7S3 


848 


-17.6 


36.5a 


44.7a 


254 


995 


450 


-12.1 


61.9a 


53.5a 


255 


1690 


679 


-24 


44.6a 


7i.aa 


256 


994 


1006 


•12.1 


30.2a 


32,i a 


257 


506 


464 


-274 


60,4a 


19.3a 


258 


1517 


820 


-4 4 


37.8a 



*Utter uble of proteins in the rat liver database, show; 



Predicted molecular mass (from the standard curve of Fig. 8) 



inf spot master number. f el position (x and y), isoelectric point relative to CPK standards, and 



924 



MSN 



L AAdcfioo nsL 



Y CPIW SOSMW 



r l«* 



MSN 



Y CPKof SOSMW 



2S0 
260 
261 
262 
263 
265 
266 
267 
266 
268 
270 
271 
272 
274 
275 
276 
277 
278 
279 
281 
282 
283 
284 
285 
286 



1706 
661 
1725 
406 
1063 
1300 
510 
660 
430 
1044 
2010 
6S7 



269 
290 
291 
292 
293 
294 
295 
296 
297 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 
311 
312 
313 
314 
315 
318 
. 320 
321 
322 
323 
324 
325 
326 
327 
328 
330 
331 
332 
333 
334 
335 
336 
336 
339 
340 
341 
343 
344 



1292 
. 1350 
1670 
688 
961 
679 
1846 
1505 
1313 
1314 
1332 
1277 
1301 
ii4? 
92S 
787 
1462 
531 
860 
1162 
218 
1377 
913 
2012 
702 
494 
403 
1843 
1049 
1606 
1219 
1627 
1524 
1760 
1609 
266 
1002 
1316 
1341 
1104 
1480 
850 
1454 
670 
655 
1521 
1567 
1386 



1606 
1566 
531 
784 
1050 
1593 
1616 
1854 
1265 
581 
1407 
1351 
1813 



061 
1361 
670 
1127 
172 
673 
437 
1038 
061 
606 
853 
422 
968 
712 
500 
1089 
536 
718 
570 
1084 
525 
1147 
629 
408 
652 
624 
570 
511 
1476 
818 
449 
698 
609 
814 
979 
15Z3 
667 
178 
1280 
1008 
1585 
5S3 
969 
916 
755 
892 
102B 
1451 
1408 
1365 
1395 
523 
1053 
1456 
603 
1494 
626 
101 
675 
677 
406 
1291 
751 
607 
471 
1156 
■407 
303 
506 
1004 



585 
1047 
265 
540 



-1.1 
20.4 
-2.0 
28.0 
•10.0 
-6.3 
-27.3 
•20 4 
-31.0 
•11.2 
>0.0 
-15.0 
-14.2 
-7.6 
-6.9 
-2.6 
-10.4 
-13.0 
-14.5 
-0.7 
-46 
-7JJ 
-7.3 
-7.1 
-7.6 
-6.3 
-0.5 . 
-13.6 
-16.6 
-5.1 
-26.3 
-14.0 
-9.3 
<-35.0 
-6.5 
-13.9 
>0.0 
-19.0 
-28.1 
-32.6 
-0.7 
-11.1 
-3.3 
-8.5 
-3.0 
-44 
-1.5 
-3.3 
<-3S.O 
•0.3 
-7.3 
-7.0 
-10.1 
-4.9 
-15.1 
-5.3 
•20.0 
-20.6 
-4.4 
-36 
-6.3 
-30.0 
-3J 
-3 8 
-26.3 
-16.7 
-10.9 
-3.5 
-3.2 
-0.6 
-6.0 
-23.6 
-4.7 
-6.8 
-0.9 



31.600 
17.700 
44.600 
25.800 
177.400 
45.000 
63.400 
29.000 
31.000 
46.900 
36.300 
65.200 
31.700 
42.900 
49.900 
27.100 
53.700 
42,600 
51.300 
27.300 
54.800 
25.100 
37.400 
67.200 
46.100 
37.600 
50.700 
55.000 
13.000 
37.800 
62.000 
43.600 
48.700 
38.000 
31,300 
12.400 
45.300 
169.200 
20.400 
30.100 
10.300 
49.600 
30.900 
33.700 
40.700 
34.700 
29.400 
14,700 
16.100 
17.600 
16.600 
54.900 
28.500 
14.400 
49.100 
13.300 
47.700 
420.500 
44.600 
44.700 
67.000 
20,100 
40.000 
43.700 
50.600 
24.700 
67.300 
88.500 
40.400 
3O.3O0 
34.000 
50.300 
28.700 
102^00 
52.800 



345 
346 
347 
348 
340 
350 
351 
352 
353 
354 
355 
356 
357 
356 
350 
360 
361 
362 
363 
364 
365 
366 
367 
368 
369 
370 
371 
372 
373 
374 
375 
376 
377 
378 
379 
381 
382 
383 
364 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
399 
400 
401 
403 
404 
405 
406 
409 
410 
411 
412 
413 
415 
416 
417 
418 
419 
420 
421 
422 
423 
424 
425 



1006 
1095 
625 
361 
110 
521 
912 
1574 
061 
706 
1450 
1374 
474 
796 
764 
1384 
1713 
1161 
. 914 
412 
741 
678 
1560 
963 
434 

1567 
1875 
1351 
1506 
1823 
254 
1409 
621 
1017 
953 
656 
1252 
1699 
1042 
1490 
1554 
1193 
1374 
1456 
718 
1799 
1482 
1227 
1530 
1410 
912 
1465 
1473 
1029 
1516 
1495 
1525 
723 
650 
1501 
936 
350 
1033 
737 
1578 
646 
1695 
725 
1289 
1171 
599 
929 
739 
1490 



578 
640 

728 
063 

1343 

1130 
610 
530 
012 
762 
630 

1152 
007 
346 
338 

1068 
766 
658 

1156 
435 
486 

1503 
035 
520 

441 

610 

860 

762 
1050 

715 

532 

417 

583 

494 

595 

598 

674 

258 
1518 

493 

563 

603 



902 
969 
690 
732 
758 
1461 
577 
755 
256 
1063 
450 
1140 
754 
554 
1092 
252 
663 
478 
1057 
1120 
538 
425 
606 
496 
482 
770 
1041 
912 
162 
856 
625 
965 



-11.0 
-10.3 
-21.7 
-35.3 
<-35.0 
-26.7 
-13.0 
-3.7 
-12.0 
-18.0 
-5.3 
-6.5 
-28.7 
-16.3 
-17 J 
-6.4 
-2.1 
•0.3 
-13.8 
-32.0 
-17.0 
-14.6 
•3.0 
-124 
-31.0 
-21.2 
-3.6 
-0.5 
-6.8 
-4.6 
-0.9 
<-35.0 
-6.1 
-21.8 
-11.7 
-13.1 
-15.0 
•8.1 
-2.3 
-11.2 
-4.7 
-4.0 
-B.9 
-6.5 
-5.2 
-18.5 
-1.1 
-4.8 
•8.4 
-4.3 
-6.0 
-13.9 
-5.0 
-4.9 
-11.5 
-44 
-4.7 
-4.3 
-184 
-20.8 
-4.6 
-13 4 
-35.9 
-11.4 
-18.0 
•3.7 
-21.0 
-2.3 
-18.3 
-7.7 
-9.1 
•22.8 
-13.6 
-17.9 
-4.7 



50.800 
46.800 
42.000 
31.100 
18.300 
25.700 
48.100 
54.300 
33.900 
40.400 
37.300 
24.900 
30.600 
77.800 
79.400 
27.900 
40.100 
36.100 
24.800 
63.700 
58.200 
13.000 
33.000 
55.200 
63.000 
46.700 
36.100 
40.400 
28.300 
42.700 
54.200 
65.900 
50.400 
57.500 
49.600 
49.400 
44.900 
105.300 
12.500 
57.500 
50.400 
49.100 
67.700 
34,300 
31.700 
44,000 
41.900 
40.600 
14.400 
50.800 
40.800 
106.400 
28.100 
61.900 
25.300 
40.800 
52.500 
27.100 
108.000 
45.500 
59,000 
28.300 
26.000 
53,700 
64.900 
48.900 
.57.300 
58.600 
40.000 
28,900 
33.900 
193.700 
36.200 
47,700 
31.600 




426 
427 
428 
429 
430 
431 
432 
434 
435 
436 
437 
438 
439 
440 
441 
443 
446 
447 



449 

'450 
451 
452 
453 
454 
\ 456 
457 
459 
460 
461 
462 
463 
464 
465 
466 
468 
469 
470 
471 
472 
473 
474 
475 
476 
477 
478 
479 
480 
482 
483 
485 
486 
487 
486 
489 
490 
491 
492 
493 
494 
495 
496 
497 
499 
500 
501 
502 
503 
504 
505 
506 
507 
508 
509 
510 



1 

810 

1565 
1250 
1253 
734 
483 
518 
1020 
1122 
1670 
435 
66 
1740 
509 
743 
801 
1050 
1245 
1576 
1818 
1094 
1945 
1652 
1403 
1394 
005 
1038 
1596 
1528 
1096 
849 
1614 
1388 
1194 
577 
1140 
1797 
1293 
618 
2009 
1205 
1035 
160 
469 
599 
1009 
1216 
816 
603 
1606 
47B 
1025 
1045 
1609 
775 
692 
1100 
1760 
862 
470 
494 
980 
1414 
1234 
1246 
824 
1246 
1115 
1189 
1578 
787 
979 
1153 
1730 



704 


•7.t 


843 


•16.0 


303 


.3 Q 
M.» 


847 


•*o 


562 


tJ.l 


1426 


•1* 1 


433 




1041 




1170 




106 


-Q ft 


673 




1102 


-31 .0 


847 


<-35.0 


544 


•1.6 


1571 


•22.8 


335 


•17.8 


666 


-16.2 


026 


•11.1 


1296 


-8.2 


1516 


-3.7 


1021 


•0.9 


440 


-10.3 


802 


>0.0 


894 


•2.8 



500 

>*ie 

436 
581 
294 
863 
1137 
1125 
1072 
481 
1084 
467 
688 
524 
1133 
655 
299 
215 
788 
155 
1370 
662 
540 
235 
346 
673 
1013 
599 
607 
1186 
301 
1289 
178 
964 
776 
247 
1258 
1436 
852 
546 
1072 
659 
792 
1134 
1407 
391 
402 
250 
552 
616 
1006 



-6.3 
•14.0 
•11.3 
34 
-4.3 
•10.2 
•15.2 
-0.9 
-6.3 
-8.9 
•23.9 
-9.6 
-1.1 
-7.6 
-21 .9 
>0.0 
-8.7 
-114 
<-3S.O 
•289 
•22.8 
-11.8 
-8.6 
-15.9 
•19.3 
-3.3 
-28.6 
-11.5 
-11.2 
-33 
-17.0 
-19.3 
-10.2 
-1.6 
-14.5 
-28.9 
-28.1 
-12.5 
-6.0 
-83 
-6.2 
-15.7 
-B.2 
-9.9 
-8.9 
-3,7 
-16.6 
-12.5 
-94 
-2.0 



36.6Q0 

63.90C 

147.0X 
<5.0Ct 
26.7* 
36.60C 
S32CC 
10. BEX 
80.1 Qc 
4S2DC 
J3 3CC 
19.80C 
12.6K 
29.6QC 
63.10C 
38.8tt 
34.800 
56.SQC 
42.800 
63 SCC 
50 500 
91.4GC 
3590C 
25 43C 
25.80C 
27.800 
56.700 
27.300 
60.100 
34.900 
54.800 
25.500 
46000 
89.000 
131.300 

39 roc 

2O7.60C 
17.400 
45.800 
53.500 
117.400 
77,800 
44.900 
30.000 
40.300 
48.800 
23.700 
89 2D0 
20.100 
160.300 
31.100 
30.700 
110.700 
71^0 
15JDD 
3*400 

5X160 
27.800 

4&WD 
39.0X 
2S5DC 
1&200 



loaooo 

S2.6* 
4**» 
3OJ0O 



800 
[1000 
(l696 
948 
481 
['1334 
868 
708 
822 
632 
11332 
603 
>H90 
479 
768 
747 
[1170 
11502 
1728 
507 
870 
:i347 
H513 
306 

1463 
It 000 
[: 625 
I 1164 
803 
1250 
856 
803 
1162 
128 
,1355 
605 
[1360 
.002 
fc112S 
705 
1477 
080 
700 
1028 



T1510 
?1212 
780 
618 
[1142 
532 
771 
M066 
,622 
014 
1064 
11824 
J.1302 
| 062 
11487 
It 756 
*667 
030 
^1688 
642 

65 
1014 

fc732 
L1627 
[1000 



r 



a 1991. /J.W7-9J0 



Daubuc of rsi |.» t r pnnems 



925 



Y CPKd SOSMW 



M»< 



Y CPKol 



SOSMW 



MSN 



Y CPKol SOSMW 



511 



aoo 



512 1009 

513 1«6 
514 
515 



048 

461 



516 
517 
518 
510 
SO 

521 1332 
SZ2 603 
52) 1100 
524 470 



666 
706 
622 
632 



S25 
526 



768 
747 



527 1170 

526 1502 

530 1726 

532 507 

533 670 

534 1347 

535 1513 

536 306 

538 1851 

539 1463 
$40 009 
541 625 
$42 1164 
543 603 
$44 1259 

545 656 

546 803 
$47 1162 
546 126 

540 1355 
550 505 

552 1369 

553 992 

555 1125 

556 705 
$57 1477 
$56 060 
558 700 
560 1028 
562 696 
564 
565 
566 



789 
777 
060 
567 1510 
569 1212 



570 
571 



760 
616 



$73 1142 

574 532 

575 771 
STB 1068 



577 
S7B 



622 
014 



1064 
$60 1524 



S61 
562 



1302 
982 



56* 1487 
565 758 



566 
567 

sea 

569 



667 
030 
1688 

642 
*0 1317 

£ « 
* 1014 

2* 732 

1627 

r 5 ioo9 



484 

533 
1034 
636 
543 
1044 
1021 
770 
670 
165 
630 
1104 
300 
1226 
1066 
1016 
231 
542 
620 
1011 
489 
1085 
346 
654 
689 
082 
561 
289 
106 
655 
1143 
1526 
1071 
274 
1321 
1122 
866 
404 
405 
410 
975 
1030 
583 
1109 
621 
704 
1446 
766 
328 
611 
661 
504 
056 
771 
787 
250 
534 
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-21.3 


1095 


1394 


-10.3 


1719 


1545 


•2.1 


791 


666 


-16.5 


964 


1021 


-12.9 


313 


195 


<-35.0 


306 


194 


<-35.0 


320 


197 


<-35.0 


326 


197 


<-35.0 


394 


294 


-33.2 


402 


294 


-32.7 


386 


294 


-33.7 


641 


329 


-21 .2 


660 


329 


-20.4 


914 


266 


-13.8 


873 


245 


-14.7 


970 


372 


-12.7 


1021 


296 


-11.6 


1392 


205 


-6.3 


1354 


203 


-6.8 


1362 


205 


-6.7 


673 


540 


-19.9 


614 


542 


•22.1 


603 


539 


-22.6 


696 


623 


-19.2 


707 


628 


-18.9 


475 


447 


-28.7 


466 


1282 


-29.0 


759 


1461 


•17.4 


1324 


1170 


•72 


1583 


1005 


-3.6 


1865 


609 


-0.6 


1612 


817 


-1.0 


1411 


703 


-6.0 


1392 


682 


-6.3 


794 


410 


-16.4 


769 


407 


•17.1 


740 


406 


-17.9 


743 


511 


-17.8 


713 


510 


-18.7 


662 


509 


-19.6 


663 


SO* 


-20.3 


565 


582 


•24.4 



SOSMW 



24.700 
35.900 
68.400 
68.800 
66.700 
54.500 
54.500 
54.800 
55.700 
55.000 
50.200 
53.700 
43.400 
124.900 
124.900 
125.100 
125.200 
124.700 
164.400 
162.600 
164,300 
131.800 
94.200 
26.200 
34.700 
20.000 
20.600 
19.400 
20.000 
13.000 
16.300 
16.200 
15.400. 
16.600 
11.600 
45.200 
29.700 
148.700 
149.800 
147.400 
146.600 
91.400 
91.200 
91.400 
81.600 
81.600 
101.800 
112.000 
72.900 
90,100 
139.500 
141.800 
139.500 
53.600 
53.400 
53.600 
47.800 
47.500 
62.300 
20.400 
14.400 . 
24.200 
30.300 
38.200 
37.900 
43.400 
44,500 
66.900 
67.300 
67,500 
55.900 
56,000 
56,100 
56.500 
50.500 



MSN 




v 

T 


CPKpt 


SOSMW 


1246 


547 


577 


-25.3 


50.600 


1247 


530 


576 


•26.3 


50.000 


1249 


516 


572 


-27.0 


51200 


1250 


073 


536 


-12.7 


53.900 


1251 


607 


532 


-22 4 


54.200 


1252 


665 


529 


-20.2 


54.400 


1253 


899 


766 


-14.1 


40.200 


1254 


1311 


746 


-7.4 


41.200 


1255 


1300 


761 


-7.5 


40.400 


1257 


1938 


712 


0.0 


42.900 


1256 


1806 


718 


-1 .0 


42.600 


1259 


1727 


715 


-2.0 


42.700 


1260 


1629 


713 


-3.0 


42.800 


1261 


1555 


717 


-4.0 


42.600 


1262 


1468 


717 


•5.0 


42.600 


1263 


1413 


722 


-6.0 


42.400 


1264 


1340 


717 


•7.0 


42.600 


1265 


1263 


717 


•6.0 


42.600 


1266 


1182 


720 


•9.0 


42.500 


1267 


1110 


717 


•10.0 


42.600. 


1268 


1055 


717 


-11.0 


42.600 


1269 


999 


717 


-12.0 


42.600 


1270 


959 


715 


-13.0 


42.700 


1271 


905 


712 


•14.0 


42.900 


1272 


857 


714 


•15.0 


42.800 


1273 


Rio 


70S 


- ■ w.v 


43.300 


1274 


774^ 


711 


-17.0 


42.900 


1277 


737 


706 


•18.0 


43.100 


1278 


702 


711 


•19.0 


42.900 


1279 


671 


710 


•20.0 


43.000 


1260 


645 


710 


-21 .0 


43,000 


1281 


617 


707 


-22.0 


43.100 


1282 


595 


704 


-23.0 


43 300 


1283 


573 


TOO 


-24.0 




1284 


552 


695 


-25.0 


43,700 


1265 


536 


694 


-26.0 




1286 


515 


687 


•27.0 


44.200 


1287 


496 


663 


•28.0 


44.400 


1266 


467 


669 


-29.0 


45.200 


1289 


447 


667 


-30.9 


45,300 


1290 


427 


655 


•31.0 


45.900 


1291 


412 


655 


-32.0 


45.900 


1292 


397 


652 


-33.0 


46.100 


1293 


381 


654 


-34.0 


46.000 


1294 


365 


653 


-35.0 


46.100 


1295 


348 


653 


<-35.0 


46.100 
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e 3. Computed p/s of rw 0 ten orcubtmjUut 



D*ub«sc of m lt«cr protein* 



hemoglobin (Hb) 



Protein Name 



protein sundards: Rabbit muscle CPK and human 



P1R 8ASP #GLU 
Name 3.9 4.1 



Rabbit musde CPK KIRBCM a 27 

28 27 
» 27 
28 27 
28 27 
28 27 
28 27 
28 27 
28 27. 
28 27 
26 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

ZS 2/ 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

28 27 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 

7 8 



#HIS 

6.0 

TT 

17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 
17 

9 
9 
9 
9 
9 
9 
9 
9 
9 
9 
9 
9 
9 



ftYS 
10.8 

3T 
33 
32 
31 
30 
29 
28 
27 
26 
25 
24 
23 
22 
21 
20 
19 
18 
17 
16 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 
0 



•ARG NH2- Caic Real 
12.5 7.0 ol CPK 



11 
10 
9 
6 
7 
6 
5 
4 
3 
2 
1 
0 
0 



18 
18 
16 
18 
18 
18 
18 
18 
16 
18 
18 
18 
18 ' 
16 
18 
16 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
18 
16 
18 
18 
18 
18 
18 
18 

3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 
3 



6.64 
6.67 
6.54 
6.42 
6.31 
6.21 
6.12 
6.03 
5.94 
5.85 
5.76 
5.67 
5.58 
5.48 
5.39 
5.29 
5.20 
5.12 
5.04 
4.96 
4.89 
4.83 
4.77 
4.71 
4.66 
4.61 
4.56 
4.52 
4.48 
4.44 
4.40 
4.36 
4.32 
4.29 
4.25 
4.22 



0.0 
-1 
-2 
•3 
-4 
•5 
-6 
•7 
-8 
-9 
-10 
-11 
-12 
-13 
-14 
-15 
-16 
-17 
-18 

' I 9 - 

-20 

-21 

-22 

-23 

-24 

-25 

•26 

-27 

-28 
-29 
-30 
-31 
-32 
-33 
-34 
-35 



7.18 

6.79 

6.53 

6.32 

6.13 

5.96 

5.78 

5.59 

5.37 

5.14 

4.91 

4.71 

4.54 



-1.8 
-3.2 
-5.3 
-7.2 
-10.0 
-12.3 
-15.5 
-18.0 
-21.0 
-25.5 
-27.2 
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Table 4. Computed pf i of some fcnovn proteins related to measured CPK p/s 





Protein Name 


PIR 
Name 


#ASP #GLU #HIS #LYS #ARG 
3.9 4.1 6X 10.8 lis 


0 


Creatine phospho kinase (CPK). raobrt musde 


. KIRBCM 


CO 


57 


i / 


04 


18 


1 


Fatty acid-binding protein, rat hepatic 


. FZRTL 


c 
O 


i J 


a 


1 c 

Id 


2 


2 


b2-microgtot>ubn. human 


MGHUB2 


/ 


p 

0 


4 


8 


5 


3 


Carbamoyl-phosohate synthase, rat 


SYRTCA 


70 

fd 




oo 
28 


95 


56 


4 


Proalbumm ( serum albumin precursor), rat 


ABRT5 


^o 
32 


57 


15 


53 


27 


5 


Serum albumin, rat 


ABRTS 


io 
oc 


Of 


15 


53 


24 


5 


Superoxid dismutase (Cu-2n. SOD), rat 


A26810 


o 


1 1 


10 


9 


4 


7 


PhosphoKpase C. phophoinosnioe-specific (?). rat 


A28807 






A 

9 


49 


21 


3 


Albumin, human 


ABHUS 


Jo 




lo 


f A 

60 


24 


9 


Apo A-l bpoprotein. rat 


A24700 


1 o 
1 0 


OA 
*4 


6 


23 


12 




proApo A-l lipoprotein, human 


LPHUA1 


ID 




5 


21 


17 


n 


NAOPH cytochrome P-450 reductase, rat 


R0RTO4 


«* 1 


OU 




0.0 
Jo 


36 


12 


Retinol binding protein, human 


VAHU 


1 o 


in 




1 A 
1U 


14 


13 


Actin beta, rat 


ATRTC 


c.3 


£0 






18 


14 


Actin gamma, rat 


ATRTC 


20 


29 


9 


19 


18 


15 


Apo A-l lipoprotein, human 


LPHUA1 


16 


30 


5 


21 


16 


16 


Apo A-IV lipoprotein, human 


LPHUA4 




43 


B 
O 


9ft 




17 


Tubulin alpha, rat 


UBRTA 


27 


37 


13 


19 


21 


10 


F 1 ATPase beta, bovine 


PWBOB 


25 


36 


9 


22 


22 


19 


Tubulin beta, pig 


UBPGB 


26 


36 


10 


15 


22 


20 


Protein disuiphide i so me rase (PDi). rat hepatic 


ISRTSS 


43 


51* 


11 


51 


9 


21 


Cytochrome b5. rat 


CBRT5 


10 


15 


6 


10 


4 


22 


Aoo C-ll liooorotein. human 


LPHUC2 


4 


7 


0 


6 


1 




Amino aod pi assumed in calutation: 




3.9 


4.1 


6.0 


10.8 


12.5 



01 



fit* 

CPK 



6.84 
7.83 
6.09 
5.97 
5.98 
5.71 
5.91 
5.92 
5.70 
5.32 
5.35 
5.07 
5.04 
5.06 
5.07 
5.10 
4.88 
4.66 
4.80 
4.49 
4.07 
4.59 
4.44 



04 

-3.0 
•5.0 
-5.5 
•62 
•9.0 
•9.2 
•9.2 
-11.9 
•13.7 
•HJ 
•15.6 
•16.9 

-16.fi 
-17.5 
•197 
•19.6 
-21.0 

•25.0 
•26.0 
•30.5 



